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G -PROTEIN COUPLED RECEPTORS 



TECHNICAL FIELD 
This Invention rdates to nucleic add and amino add seqmots of C-procdn coupled receptors 
and to the use of Owe sequences in the diagnosis treatmax. and prevention of cefl proliferative, 
neurological, cardiovascular, gastrointestinal, awauiOTjne/unTariimatory. and mrlafrrtic disorders, and 
viral infections, and in foe assessment of die effects of exogenous compounds on the expression of 



BACKGROUND OF THE INVENTION 
Signal transduction fa the general process by which cetfa respond to extracellular signals. 
Signal transduction across the plasma membrane begins with the blading of a signal molecule, eg., ■ 

er, or growth factor, to a cell caembrane receptor. The receptor, thus activated, 
le that ends witih the acthralion of an intracellular target 
13 molecule, such as a transcription factor. This process of signal transduction regulates all types of cell 
functions Including ceO proliferation. Differentiation, and gene transcription. The G-protein coupled 
receptors (GPCRs). ercoded by one of the largest, families of genes yet identified, play a central role in 
the transduction of extracellular signals across the plasma membrane. GPCRs have a proven history of 
being successful therapeutic targets. 
20 GPCRs arc integral membrane proteins characterized by the presence of seven hytrophobic 

transmembrane domains which together form a bundle of antiporalld alpha (a) helices. GPCRs range 
In size from under 400 to over 1000 amino adds (Strosberg. A.D. (1991) Eur. J. Biocbem. 196:1 -10; 
CougWin, S.R. (1994) Curr. Opin. Cell BioL 6:191-197). The amino-ierminus of a GPCR is 
extracellular, is of variable length, and Is often glycosylated. The cartxwy-terminus is cytoplasmic and 
25 generally phosphorylatcd. Extracellular loops alternate with iztracdhilar loops and link the 

transrnembrane dooiains. Cysteine disulfide bridges linking the second and third extracellular loops 
may interact with agonists and antagonists. The most conserved domains of GPCRs are the 
transmembrane domains and the lim two cytoplasmic loops. TheU 
part, for structural and ftinctional features of the receptor. Inn 
30 a ligand-binctlng pocket. The extracellular N-termina] segmera. or one or more of the three extraceilulsr 
loops, may also participate tn tigand binding. Ligand binding activates the receptor by inducing a 
confcrmatiofial change in intracellular portions of the receptor, tn mm. Ore large, third intracellular 
loop of the activated receptor interacts with a heterotrimeric guanine nucleotide binding (G) protein 
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messengers such as cyclic AMP (cAMP), phosphctipase C, and Inositol triphosphate, and the 
interaction of the activated GPCR with km channel proteins. (See, eg., Watson, S. and S. Aridnstall 
(1994) The Q-prptti" Linked, Receptor Factf ppok. Academic Press. San Diego CA, pp. 2-6; 
Bolander. F.F. (1994) Molecular Endocrinology. Axademic Press. San Diego CA, pp. 162-176; 
S Baldwin. )M. (1994) Curr. Opin. Cell Biol. 6:1 80-190.) 

GPCRs include receptors for sensory signal mediators (eg,, light and olfactory stimulatory 
molecules); adenosine, y-aminobutyric add (GAB AX hepatocyte growth factor, melanoconins, 
neuropeptide Y, opioid peptides, opsins, somatostatin, tachykinins, vasoactive intestinal polypeptide 
family, and vasopressin; biogenic amines (eg. dopamine, epinephrine and norepinephrine, histamine. 
10 glutamaie (nwabouopic effect), acetylcholine (muscarinic effect), and serotonin); chemoklncs; lipid 
s of inflammation (eg. prostaglandins and prostanoids, platelet activating factor, and 
es); and peptide hormones (eg, bo m b e si n , bradykintn. calcitonin, CSa anapbylatoxin, 

e (FSH), gonadotropic-rdcasing hormone (GnRH), neurokinin, 
g hormone (TRH), and oxytocin). GPCRs which act as receptors for stimuli 
that have yet to be identified are known as orphan receptors. 

The diversity of the GPCR family is further increased by alternative splicing. Many GPCR 
genes contain introns. and there are currently over 30 such receptors for which splice variants have been 
Identified. The largest number of variations are at the protein C-terrninus. N-termina] and cytoplasmic 
loop variants are also frequent, while variants in the extracellular loops or Oransmembrane domains are 
less common. Some receptors have more than one site at which variance can occur. The spacing 
variants appear to be functionally distinct, based upon observed differences in distribution, signaling, 
coupling, regulation, and ligand binding proGles (Kirpatrick. GJ. el at. (1999) Trends Pharmacol. Sd. 
20:294-301). 

GPCRs can be divided into three major subfamilies: the rhodopsin-like, secretin- U kn, and 
metabotropfc grotamate receptor sub famili e s. Members of these GPCR subfamilies share similar 




- The largest family consists of the rhodopsin-like GPCRs, which transmit diverse 
extracellular signals including hormones, neurotransmitters, and light. Rhodopsin is a photosensitive 
GPCR found in animal retinas. In verte b r at e s , rhodopsin molecules are embedded in membr a nous 
stacks found in photoreceptor (rod) ceils. Each rhodopsin molecule responds to a phot an of light by 
triggering a decrease in cGMP levels which leads to the closure of plasma membrane sodium channels. 
In this manner, a visual signal fat co n ver te d to a neural Impulse Other rhodopsin-like GPCRs are 
directly involved in responding to namxransmftlers. These GPCRs Include the receptors f 
(adrenergic receptors), acetylcholine (muscarinic receptors), adenosine, galanin, and glutamate (N- 



niethyl-D-aspartate/NMDA receptors). (Reviewed in Watson, S. and S. Aricinstall (1994) The G- 
Protctn United Receptor Facts Book. Academic Press. San Diego CA. pp. 7-9, 19-22. 32-35. 130-131, 
214-216, 221-222; Habert-Grtoli. E. et al. (1994) Proa Natl. Acad. Sd. USA 915780-9783.) 

The galanin receptors mediate the activity of the m 
Inhibits secretion of insulin, acetylcholine, serotonin and n 
growth hormone release. Galanin receptors are involved tt 
Alzheimer's disease (Kask, K. et al. (1997) Life ScL 60. 1523- 1533). Other nervous system rhodopshv 
■Uke GPCRs indude a growing family of receptors for lysrpfiospbatidic tod and other 
lysophospfaoiiptds. which appear to have rotes In development and nrajnspathology (Chun, I. et al. 
(1999) Cell Biochem. Bicphys. 30:213-242). 

The largest subfamily of GPCRs, the olfactory receptors, are also members of the rhodopsin- 
like GPCR family.. These receptors function by transducing odorani signals. Numerous distinct 



only one type of olfactory receptor, and distinct spatial zones of w 
are found in nasal passages. For example, the RAlc receptor which was isolated from a rat brain 
library, has been shown to be limited in expression to very dfatind regions of the brain and a defined 
rone of the olfactory epithelium (Ranting, K. et al. (1998) Receptors Channels 6:141-151). However, 
the expression of olfactory-like receptors is not confined to otfactcry tissues. For example, three rat 
genes enco d ing dfactary-like receptors having typical GPCR characteristics showed expression 
patterns not only in taste and olfactory tissue, but also in male reproductive tissue (Thomas, M.B. et al. 
(1996) Gene 178:1-5). 

Members of" the secretin- like GPCR subfamily have as their ligands peptide hcrmones such as 
secretin, calcitonin, glucagon, growth Ixxancaso-rdeasing hormone, parathyroid hormone, and 
vasoactive intestinal peptide. For example, the secretin receptor responds to secretin, a peptide 



I. PP- 278-283). Secretin receptors are about 450 amino adds in length and are found in the 
plasma me m brane of gastrointestinal ceils. Binding of secretin to its re t xpto r sti m ula t es the production 
ofcAMP. 

» (Emrl) and CD97 receptor 
proteins. These GPCRs are members of the recently characterized EGF-TM7 receptors subfamily. 
These seven transmembrane hormone receptors exist as be tero d ts neni in vivo and contain between 
three and seven potential caldum-binding EGF-like motifs. CD97 is predominantly exp ress ed in 
leukocytes and is markedly upregulated on activated B and T cells (Mc Knight, A_J. and S. < 
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(1998) j. Leukoc. Bkd. 63:77 1-280). 

The third GPCR tub f unity is the metabotropje gluts male receptor family. Gfutamate Is Ih6 
major excitatory ana ou ansmitter in the centra) nervous system. The metabotropic glutamate recept o rs 
modulate the activity of intracellular effectors, and are Involved in long-term potentiation (Watson. 
5 supra. p.l30\ The Ce* -sensing receptor, which senses changes in the extracellular concentration of 
calcium ions, has a large eatracetmlar domain including dusters of addae amino adds winch may be 
invorved in calcium binding. The metabotropic ghitamate receptor famOy abb Includes pheromone 
receptors, the CAB A, receptors, and the taste receptors. 

Other subfamilies of GPCRs include two groaps of chernoreceptor genes found in the 

10 nematodes Caencrhabdltis efeeans and Caenorhabdltis britecac, watch are distant] v related to the 

mammalian olfactory receptor genes. The yeast pherornone receptors STE2 and STE3, involved in the 
response to mating factors on the ceil membrane, have their own seven-lraramerobrane signature, as do 
the cAMP receptors from the slime mold Dtovcxtrthrm tftsefwdeorn, which are thought to regulate the 
aggregation of individual ceils and control the expression of numerous devctop m e nt a l ry-reg ul ated genes, 

15 GPCR mutations, which may cause loss of function or constitutive activation, nave been 



For instance, ■ 



s in the rhodopsin gi 



s pigmentosa may 
the 



thyrotropin, receptor have been reported to cause hyperfunc tion i n g thyroid adenomas, suggesting that 
certain GPCRs susceptible to constitutive activation may behave as pnxc*jncogene* (Parma, J. et al. 

20 (1993) Nature 365:649-651). GPCR. receptors for the following ligands also contain mutations 

associated with human disease? luteinizing hormone (precocious puberty}; vasopressin V 3 (X -linked 
nephrogenic diabetes); glucagon (diabetes and hypertension); calcium (hyperparathyroidism, 
hypocalcuria. hypercalcemia); parathyroid hormone (short limbed dwarfism): ^adrenoceptor (obesity, 
ncsMnsu tin-dependent diabetes tndlitus); growth hormone releasing hormone (dwarfism); and 

25 tidmiocorticorjopm (giuc^ 

1392; SUdd, J.M et al. (1997) Trends Pharmacol. Sd. 18:430437). GPCRs are also involved in 
depression, schizophrenia, sleepless n ess, hypertension, anxiety, stress, renal failure, and several 
cardiovascular disorders (Horn, F. and G. Vrfend (1998)1. Md. Med. 76:464468). 

In adc5 Hon, within the past 20 years several hundred new drugs have been recognized that are 

30 " directed towards activating or i nh i biti ng GPCRs. The therapeutic targets of these drugs span a wide 
range of diseases and disorders. Including cardiovascu la r, gastrointestinal, and central nervous system 
disorders as wdl as cancer, osteoporosis and endometricgfal (Wilson, supra: Stadd. supra) . For 
example, the dopamine agonist L-dopa Is used to treat Parkinson's di s e ase, while a dopamine antagonist 
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adrenoceptors have been used for ti 



it of asthma, high blood pressure, other car diovas c u la r 
e used rn the treatment of glauc o ma and tachycardia; 
serotonin 5 HT1 D antagonists are used against migraine; and histamine HI antagonists are used against 
allergic and anaphylactic reactions, hay fever, itching, and motion sickness (Horn, suoral 
5 Recent research suggests potential future therapeutic uses for GPCRs in the treatment of 

metabolic disorders including diabetes, obesity, and osteoporosis. For example, mutant V2 vasopressin 
receptors causing nephrogenic diabetes could be functionally rescued [fl vjtr o by co-expression of a C- 
tenrdnal V2 receptor peptide spanning the region containing the mutations. This result suggests a 
possible novel strategy for disease treatment (Scbooeberg. T. et al. (1996) EM BO J. 15:1283-1291). 
10 Mutations In melanooortin-4 receptor (MC4R) are implicated In human weight regulation and obesity. 
As with the vasopressin V2 receptor mutants, rhese MC4R mutants are defective in trafficking to the 
e(Ho. G. and R.G. MacKenzic ( 1 999) J. BioL Cbem. 274:35816-35822). and thus 
h a similar strategy. The type 1 receptor for parathyroid hormone (PTH) is a 
GPCR that mediates the PTTI-ctependent regulation of calcium horneostasij in the bkxxtstream. Study 
15 of PTH/receptor interactions may enable the development of norvel PTH receptor ligands for the 
treatment of osteoporosis (Mannstadu M. et al. (1999) Am. J. Physiol. 277:F665-F675). 

The chemokine receptor group of GPCRs have potential therapeutic utility In inflammation and 
infectious disease. (For review, sceEocati, M. and P.M. Murphy (1999) Annu. Rev. Med. 50:425- 
440.) Chernokines are small polypeptides thai act as intracellular signals in the regulation of leucocyte 
20 trafficking, hemMopotesit, and angiogenesis. TargcTed disruption of various cfonokine receptors in 
trace Indicates that these receptors play roles in pathologic inflammation and In autoimmune disorders 
such os multiple sclerosis. Cheznokine receptors are also exploited by infectious agents, including 
herpesviruses and the human inununooeficiency virus (H1V-1) to faciljute infection. A truncated 
version of chemokine receptor CCR5. which acts as a coreceptor for infection of T-cdlsby HIV-1, 
25 results in resistance to AIDS, suggesting that CCR5 antagonists could be useful in preventing the 
devdopment of AIDS. 

The discovery of new G -protein coupled receptors and the pdynudeotides encoding them 
satisfies a need in the art by providing new compositions winch are riseful in the diagnosis, prevention, 



it of the 



protein coupled receptors. 



SUMMARY OF THE INVENTION 
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The Invention features purified polypeptides, G-protdn coupled receptors, referred to 
collectively as "GCREC" and individually as "GCREC-l." "GCREC-2," "GCREC-3." "GCREC-4," 
"GCREC-5.- "CCREC-6," "GCREC-7 - -GCREC-8.'* "CCREC-9." "GCREC- 10." "GCREC-l 1 
"GCREC-l 2," "GCREC-l 3," "GCREC-14.- "GCREC-l 5." "GCREC-l 6," "GCREC-l 7," "GCREC- 
18." "GCREC- 19," "GCREC-20." and "GCREC-21.'* In one aspect, the invention provides an isolated 

sequence selected from die group consisting of SEQ ID NO: 1-21. b) a naturally occurring amino add 
sequence having at least 90% sequence identity to an amino add sequence selected from the group 
g of SEQ LD NO: 1-21, c) a biologically active fragment of an amino add u 
g of SEQ ID NOI-21. and d) an ii 
sequence selected from the group consisting of SEQ ID NO: 1-21. Ino 
provides an isolated polypeptide comprising the amino add sequence of SEQ ID NO: 1-21 . 

The Invention further provides an isolated pdynudeoUde encoding a polypeptide comprising an 
3 co n sist in g of a) an amino add sequence teaeded from the 
naturally occurring amino add sequence having at least 
90% sequence Identity to an amino add sequence sele c t e d from the group consisting of SEQ ID NO: 1 - 
21. c) a bidogically active fragment of an amino add sequence selected from the group consisting of 
SEQ ID NO 1-21, and d) an immunogenic fragment of an amino add sequence selected from rhe group 
consisting of SEQ ID NO:l-21. In one alternative, the pdynudeotide encodes a polypeptide selected 
from the group consisting of SEQ ID NO: 1-21. In another alternative, the porvnodeotide is selected 
from the group consisting of SEQ ID N022-42 

Additionally, the invention provides a recombinant poly nu cl e otide comprising a promoter 



IS group consisting of SEQ ID NO: 1-21. b) a 



sequence selected from the group consisting of a) an amino add sequence selected from the group 
consisting of SEQ ID NO.1-21, b) a naturally occurring amino add sequence having at least 90% 
sequence identity to an amino add sequence selected from the group consisting of SEQ ID NO.1-21. c) 
a biologically active fragment of an amino add sequence s el e ct e d from the group consisting of SEQ ID 
NO.1-21, and d) an immunogenic fragment ot* an amino add sequence sctecred from the group 
consisting of SEQ ID NO 1-21. In one alternative, the invention provides a cdl transformed whh the 
recombinant polynucleotide. In another alternative, the invention provides a transgenic organism 
comprising me recombinant polynucleotide. 

The Invention also provides a method for producing a polypeptide comprising an amino add 
sequence selected from Ihe group consisting of a) an amino add sequence selected from the group 
consisting of SEQ ID NO: I -21 , b) a naturally occurring amino add sequence having at least 90% 
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sequence identity to an amino acid sequence selected from the group consisting of SEQ ID NO:l-21. c) 
a bidogically active fragment of an amino add sequence selected from the group consisting of SEQ ID 
NO: I -21, and d) an Immunogenic fragment of an amino add sequence selected from the group 
consisting of SEQ ID NO:1-21. The method compr is es a) culturing a cdl under conditions suitable for 
5 expression of the polypeptide, wherein said cdl is transformed with a recombinant pdyuudecocte 
comprising a promoter teqoeoce opera biy linked to a polynucleotide encoding the pdypeptide, and b) 
recovering the polypeptide so expressed. 

Additionally, the invention provides ah isolated antibody which specifically binds to a 
pdypeptide comprising an amino acid sequence selected from the group consisting of a) an amino acid 

10 sequence selected from the group consisting of SEQ ID NO:l-21, b) a naturally occurring amino add 
sequence having at least 90% sequence identity to an amino add sequence selected from the group 
consisting of SEQ ID NO.1-21. c) a biological ty active fragment of an amino add u 
from the group consisting of SEQ ID NO 1 -2 1 , and d) an n 
sequence selected from the group consisting of SEQ ID NO.1-21. 

15 The invention further provides an isolated pdynndeotide co m p ri si n g a poly nu cleotide se qu en ce 

selected from the group consisting of a) a polynucleotide sequence selected from the group consisting of 
SEQ ID NO:22-42, b) a naturally occurring polynucleotide sequence having at least 90% sequence 
identity to a polynucleotide sequence selected from the group consisting of SEQ ID NO:22-42, c) a 
polynucleotide sequence complementary to a), d) a polynucleotide sequence complementary to b). and e) 

20 an RNA equivalent of a>d). In one alternative, the polyiaideotide co mp ris e s at least 60 contiguous 



Additionally, rhe invention provides a method for detecti n g a target polynucleotide la a sample, 
laid tar go porynudeotide laving a s^ 

sdected from the group consisting of a) a porynudeotide sequence sel e cte d from the group con s is ting of 
25 SEQ ID NO.22-42, b) a naturally occurring porynudeotide sequence having at least 90% sequence 
d from the group consisting of SEQ ID NO.22-42, c) a 
y to a), d) a porynudeotide sequence co m ple ment ary to b), and e) 
an RN A equivalent of a)-d). The method comprises a) hybridizing the sample with a probe comprising 



30 In rhe sample , and which probe spe cifical ly hybridizes to said target pol yn ucleotide, ui 

whereby a hybridization complex is formed between said probe and said target polynucleo tid e or 
fragments thereof, and b) drtrfting the pr esen c e or ab senc e of* said hybridization complex, and 
optionally, if present, the amount thereof. In one alternative, the probe comprises at least 60 contiguous 



6 



7 



The invention further provides a method for detect i ng a 



mi 
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In a sample said 



target polynucleotide having i sequence of ■ polynucleotide comprising a polynucleotide » 
detected from the group coosiaing of a) a polynucleotide s eq ue n c e selected from the group consisting of 
SEQ ID N 0:22-42, b) a mtuiMly occurring polynucleotide sequence having at least 90% sequence 
identity to a polynucleotide sequence selected from the group consisting of SEQ ID N 0:22-42, c) a 
pofynuckoride sequence complementary to a), d) a polynucleotide sequence complementary to b). and e) 
an RNA equivalent of a)-d). The method comprises a) amplifying said target polynucleotide or 
fragment thereof using polymerase chain reaction amplification, and b) delecting the presence or 
absence of said amplified target polynucleotide or fragment thereof , and, optionally, if present, the 



The invention further provides a composition comprising an effective amount of a polypeptide 

selected from the group consisting of SEQ ID NO : 1-21. b) a naturally occurring a 
having si least 90% sequence identity to an a 
SEQ ID NO:l-21. c) a biologically active fragment of* an amino add sequence selected from the group 
consisting of SEQ ID NO:l-21. and d) an immunogenic fragment of an amino add sequence selected 
from the group consisting of SEQ ID NO: 1-21 , and a phartnaccuticatly acceptable exdpient In one 
embodiment, the composition comprises an amino add sequence selected from the group consisting of 
SEQ ID NO: 1-21. The invention additionally provides a method Of treating a disease or condition 
associated with decreased expression of functional GCREC, comprising administering to a patient in 
need of such treatment the composition. 

The Invention also provides a method for screening a compound for effectiveness as an 

x, selected from the group consisting of a) an 
a from the group consisting of SEQ ID NO: 1 -2 1 , b) a naturally occurring 
amino acid sequence having at least 90% sequence identity to an amino acid sequence selected from the 
group consisting of SEQ ID NO:l-21, c) a biologically active fragment of an amino add sequence 
selected from the group consisting of SEQ ID NO: 1-21 , andd) an immunogenic fragment of an amino 
add sequence selected from the group consisting of SEQ ID NO: 1-21. The method comprises a) 
exposing a sample comprising the polypeptide to a compound, and b) detecting agonist activity In the 
sample. In one alternative, the invention provides a composition comprising an agonist compound 
identified by the method and a pharmaceutically acceptable exdpient. In another alternative, the 
invention provides a method of treating a disease or condition associated with decreased expression of 
functional GCREC, comprising administering to a patient in need of such treatment the composition. 
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of a) an amino add sequence sdected from Che group consisting of SEQ ID N 0:1-21, 1 
occurring annuo add sequence having at least 90% sequence identity to an amino add sequence 
selected from the group consisting of SEQ ID NO: 1-2 1, c) a biologically active fragment of an amino 
add sequence selected from the group consisting of SEQ ID NO: 1-21. and d) an immunogenic fragment 
of an amino acid sequence selected from the group consisting of SEQ ID NO. 1-21. The method 
comprises a) exposing a cample comprising the polypeptide to a compound, and b) detecting . 
antagonist activity in the sample- In one alternative, the invention provides a composition comprising 
an antagonist compound identified by the method and a pharmaceuticaily acceptable exdpient. In 
another alternative, the invention provides a method of treating a casease or condition associated with 
overexpression of functional GCREC, comprising administering to a patient in need of such treatment 



The invention further provides a method of screening for a compound that spedficalty binds 
to a polypeptide comprising an amino add sequence selected from the group consisting of a) an amino 
acid sequence selexaed from the group consisting of SEQ ID NO:1-21,b) a ruturally occurring amino 
add sequence having at least 90% sequence identity to an amino add sequence selected from the group 
consisting of SEQ ID NO: 1-21. c) a biologically active fragment of an amino add sequence selected 
from the group consisting of SEQ. ID NO. 1-21. and d) an immunogenic fragment of an amino add 
sequence selected from the group consisting of SEQ ID NO: 1-21. The method comprises a) combining 
the polypeptide with at least one test compound under suitable conditions, and b) detecting binding 
or the polypeptide to the lest compound, thereby identifying a compound that spedflcally binds to the 
polypeptide. 

The invention further provides a method of screening for a compound that modulates the 
activity of a polypeptide comprising an amino add sequence selected from the group consisting of a) an 
amino add sequence selected from the group consisting of SEQ ID NO:l-21,b) a naturally occurring 
amino acid sequence having at least 90% sequence identity to an amino add sequence selected from the 
group consisting of SEQ ID NO: 1-21, c) a biologically active fragment of an amino add sequence 
selected from the group consisting of SEQ ID NO: 1-21, and d) an immunogenic fragment of an amino 
. acid sequence selected from the group consisting of SEQ ID NO:l-2l. The method comprises a) 
combining the polypeptide with at least one test compound under conditions permissive for the 
activity of the polypeptide, b) assessing the activity of the polypeptide in the presence of the test 
compound, and c) comparing the activity of the polypeptide in the presence of the test compound with 
the activity of tbe polypeptide In the absence of the test compound, wherein a change in the activity of 
the polypeptide in the presence of the test compound is indicative of a compound that modulates the 
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activity of the polypeptide. 

The invention further provides a method for screening a compound for effectiveness in 
altering expression of a target polynucleotide, wherein said target polynucleotide comprises a 
sequence selected from the group consisting of SEQ ID NO: 22-4 2 the method comprising a) 
exposing a sample comprising the target polynucleotide to a compound, and b) detecting altered 
expression of the target polynucleotide. 

The invention further provides a method for assessing toxidty of a test compound, said 
method comprising a) treating a biological sample containing nucleic adds with the test compound: 
b) hybridizing the nudelc adds of the treated biological sample with a probe comprising at least 20 
contiguous nucleotides of a polynudeotide comprising a polynucleotide sequence selected from the 
group consisting of i) a polynudeotide sequence selected from the group consisting of SEQ ID 
NO:22-42, ii) a naturally occurring polynucleotide sequence having at least 90% sequence identity to 
a polynudeotide sequence selected from the group consisting of SEQ ID NO: 22-42, iii) a 
polynudeotide sequence complementary to IX iv) a polynucleotide sequence complementary to ii), 
and v) an RNA equivalent of i>-iv). Hybridization occurs under conditions whereby a sped fie 
hybridization complex is formed between said probe and a target polynucleotide in the biological 
sample, said target polynudeotide comprising a polynudeotide sequence sdected from the group 
consisting of 1) a polynucleotide sequence selected from tbe group consisting of SEQ ID NO:22-42, 
ii) a naturally occurring polynudeotide sequence having at least 90% sequence identity to a 
polynudeotide sequence selected from the group consisting of SEQ ID NO:22-42, iii) a 
polynucleotide .sequence complementary to I), lv) a polynucleotide sequence complementary to ii}. 
and v) an RNA equivalent of i)-iv). Alternatively, the target polynucleotide comprises a fragment of 
a polynudeotide sequence sdected from the group consisting of i)-v) above; c) quantifying the 
amount of hybridization complex; and d) comparing the amount of hybridization complex In the 
treated biological sample with tbe amount of hybridization complex in an untreated biological 
sample, wherein a difference in the amount of hybridization complex in the treated biological sample 
is Indicative of toxicity of the lest compound. 

BRIEF DESCRIPTION OF THE TABLES 

;tb polynucleotide and polypeptide 



Table 2 shows the GenBank identification number and annotation of the nearest GenBank 
homolog for polypeptides of the invention. The probaWHiy score for the match between each 
polypeptide and lis GenBank homolog is also shown. 
33 Table 3 Shows structural features of polypeptide sequences of the invention. Including predicted 
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motifs and domains, along with the methods, algorithms, and searchable databases used for analysis of 
the polypeptides. 

Table 4 lists the cDNA and genomic DNA fragments which were used to assemble 

5 of the invention, along with selected fragments of the polynudeotide 



Table 5 shows the representative cDNA library for polynucleotides of the invention. 
Table 6 provides an appendix which describes the tissues and vectors used for construction of 
the cDNA libraries shown in Table 5. 

Table 7 shows the tools, programs, and algorithms used to analyze the polynucleotides and 
10 polypeptides of the invention, along with applicable descriptions, references, and threshold parameters. 

DESCRIPTION OF THE INVENTION 
Before the present proteins, nucleotide sequences, and methods are described, it is understood 
that this invention is not limited to the particular machines, materials and methods described, as these 
IS may vary. It is also to be understood that the terminology used herein is for the purpose of describing 
particular embodiments only, and is not intended to limit the scope of the present invention which will 
be limited only by the appended claims. 

It must be noted that as used herein and in tbe appended claims, the singular forms V "an," 
and "the" include plural reference unless the context dearly dictates otherwise. Thus, for example, a 
20 reference to "a host cell" Includes a plurality of such host cells, and a reference to "an antibody" is a 
reference to one or more antibodies and equivalents thereof known to those skilled in the art, and so 
forth. 

Unless defined otherwise, all technical and scientific terms used herein have the same meanings 
as commonly understood by one of ordinary skill in the art to which this Invention belongs. Although 

25 any machines, m a t erials, and methods similar or equivalent to those described herein can be used to 
practice or test the present invention, the preferred machines, materials and methods are now described. 
All publications mentioned herein are dted for the purpose of describing and disclosing the cell lines, 
protocols, reagents and vectors which are reported in tbe publications and which might be used in 
connection with the invention. Nothing herein is to be construed as an admission that the invention is 

30 not entitled to antedate such disclosure by virtue of prior invention. 
DEFINITIONS 

"GCREC" refers to the amino add sequences of substantially purified GCREC obtained from 
any species, particularly a mammalian spedes, including bovine, ovine, porcine, murine, equine, and 
human, and from any source, whether natural, synthetic, semi-synthetic, or recombinant. 
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The terra "lgomsT refers to a motecnte which intensifies or minu^P^iutoglca] activity of 
OCR EC. Agonists may indode proteins, uuctec adds, caxborrydrates, small molecules, or my other 
canpound cr composition which rncdubrles the »cUvityof GCRECdther by directly imeraalng with 
GCREC or by acting on componen ts of the biological pathway in which GCREC partidpaies. 

5 An "allelic voriarfl" b an alternative form of the gene encoding GCREC. Allelic variants may 

result from at least one mutation in the nuctek add sequence and may result in altered mRNAs or in 
polypeptides whose structure or function may or may not be altered. A gene may have none. one. or 
many aHeSc variants of its naturally occurring form. Common mutational chang e s which give rise to 
allelic variants are generally ascribed to natural deletions, additions, or substitutions of nucleotides. 

10 Each erf these lypa of changrs may occur alone, or in combination with the others, one rr nwre limes In 
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Altered" nucleic add sequences encoding GCREC include those sequences with deletions. 

Jtutiom of different nucleotides, resulting in a polypeptide the same as GCREC or a 
polypeptide with at least one functional characteristic of GCREC. lncruded within this definhJon are 
15 porymcrphisins which may or may not be readily detectable using a particular oligonucleotide probe of 
the polynucleotide encoding GCREC. and improper or unexpected hybridization to allelic variants, with 
a locus other than the normal chromosomal locus for the polynucleotide sequence encoding GCREC. 
Trie encoded protein may also be "altered." and may contain deletions, insertions, or substitutions of 
ami*) acid residues which produce a silent change and result in a functionally equivalent GCREC 
20 Deliberate amino add substitutions may be made nn ihc basis of similarity in polarity, charge, 

sotubiliry. rrydropriobictty, hydropbilicity, antVor the amphipathic nature of the residues, as long as the 
biological or Immunological activity of GCREC is retained. Fur example, negatively charged amino 
odds may include asportic acid and glutamic acid, and positively charged amino acids may include 
lysine and argiruhe. Amino adds with uncharged polar side chains having similar hydrophJlldty 
25 values may include: asparagj ne and glut amine; and serine and threonine. Amino acids with 

uncharged ride chains having similar hydropbilicity values rosy include: leucine, iscleudne. and 
valine; glycine and alanine; and phenylalanine and tyrosine. 

The terms "amino add" and "amino add sequence" refer to an oligopeptide, peptide, 
polypeptide, or protdn sequence, or a fragment of any of these, and to naturally occurring or synthetic 
30 molecules Where "amino acid sequence" is recited to refer to a sequence of a naturally occurring 
protein molecule, "amino acid sequence" and like terms are not meant to limit the amino add sequence 
to the complete native amino acid sequence associated with the recited protein molecule, 

" Amplification" relates to the production of additional copies of a nucleic acid sequence, 
e b generally carried out using polymerase chain reaction (PGR) it 



in the art. 

The term "antagonist" refers to a molecule which inhibits or attenuates the biological activity of 
GCREC. Antagonists may include proteins such as antibodies, n uddc adds, carbohydrates, small 
mote rules, or any other compound or composition which modulates the activity of GCREC either by 
3 directly interacting with GCREC or by acting on components of the bkflogteal pathway in which 
GCREC participates. 

The term "antibody" refers to intact trnmunoglabulin molecules as wen as to fragments thereof. 

such as Fab, F(«b% and Fv fragments, which are capable of binding an epdlccric determinant. 

Antibodies that bind GCREC polypeptides can be prepared using intact polypeptides or using 
10 fragments containing small peptides of interest as the immunizing antigen. The polypeptide or 

oligopeptide used to trnmunfae an animal (eg. a mouse, a rat, or a rabbit) can be derived from the 

translation of RN A, or synthesized chemically, and can be conjugated to a carrier protein If desired. 

Commonly used carriers that are chemically coupled to peptides Include bovine serum albumin. 

thyreoglobulin, and keyhole limpet hemocyanin (KLH). The coupled peptide is then used to immunize 
IS the animal. 

The term "antigenic determinant" refers to that region of a molecule (I.e., an epitope) that 
makes contact with a particular antibody. When a protein or a fragment of a protein is used to 

s regions of the protein may Induce the production of a n t ib odies which 
nninanfs (particular regions or three-dimensional structures on the 
20 protein). An antigenic derermteant may compete with me intact antigen (1-C the i mmu oogen used to 
didt the immune response) for binding to an antibody. 

The term "antisense" refers to any cornposilioo capable of base-pairing with the "sense" 
(coding) strand of a specific nuddc add sequence. Anflsense compo si tions may include DMA; RNA; 
peptide nuddc add (PNA); oUgonudeotides having modified backbone linkages such as 
25 ptosphonxhioates, metiiytphosphonates, or benzylptesphonatcs; otlgornideotides having modified 
sugar groups such as 7-methaxyetbyl sugars or 7-mctbOKvcthoxy sugars; or oUgonudeotides baring 
modified bases such as 5 -methyl cytosine, 7-deoxyuracfl, or 7-oeaza-r-deoxyguancsine Antisense 
molecules may be produced by any method Including c h e mic al synthesis or transcription. Once 
introduced into a cell, the complementary a n tise n se molecule base-pairs with a naturally occurring 
30 rtudetc add sequence produced by the oefJ to form duplexes which block ei 



•e strand of a reference DNA molecule 
The term "biologically active" refers to a protein having structural, regulatory, or biochemical 
functions of a naturally occurring molecule Likewise, "immunologically active" or "inununogenic" 
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refers to the capability of the natural, recombinant, or synthetic GCREC, or of any oligopeptide thereof, 
to induce a specific immune response in appropriate animals or cells and to bind with specific 



o ringh 



5 sequences that anneal by base-pairing. For example. 5 -AGT-3' pairs with its complemea, 
3-TCA-5*. 

A "composition comprising a given polynucleotide sequencer and a "composition comprising a 
given amino add sequence" refer broadly to any composition containing the given polynucleotide or 
amino add sequence. The composition may comprise a dry formulation or an aqueous solution. 
10 Compositions comprising polynucleotide sequences encoding GCREC cm fragments of GCREC may be 
employed as hybridization probes. The probes may be stored in freeze-dried form and may be 
a carbohydrate. In hybridizations, the probe may be 
alts (eg., NaCl). detergents (eg. sodium dodecyl sulfate; 
is (eg, Denhardt's solution, dry milk, salmon sperm DNA. etc.). 
15 "Consensus se q u e nce " refers to a nud cic add sequence which has been subjected to repeated 

DNA sequence analysis to resolve uncalled bases, extended using the XE-PCR kit (Applied Biosystems, 
Foster City CA) in the 5* and/or the 3' direction, and resequenccd. or which has been a ssembl ed from 
one or more overlapping cDN A EST. or genomic DNA fragments using a computer program for 
fragment assembly, such as the GEL VIEW fragment assembly system (GCG, Madison WT) or Phrap 
20 (University of Washington, Seattle WA). Some seq ue nce s have been both extended and assembled to 



"Conservative a 



" are those substirutions that are predicted to least 



interfere with the properties of the original protein, i.e. the structure and espedalry the function of the 
protein to conserved and not significantly changed by such substitutions. The table below shows amino 
adds which may be substituted for an original amino add In a protdn and which are regarded as 
conservative amino acid substihitions. 



Original Residue Conservative Subst) 


Ala 


Gly.Ser 


Arg 


Hls.Lys 


Asn 


Asp. Gin, His 


Asp 


Asn. Glu 


Cys 


Ala, Scr 


Gin 


Asn.Gru.HiS 


Glu 


Asp, Gin, His 


Gly 


Ala 


His 


Asn. Arg, Gin, Ghj 


ne 


Leu. Vol 



Lys 
Met 
Phe 
Ser 
Thr 
Trp 
Tyr 
Val 



He, Val 
Arg, Gin, Glu 
Leu, He 

His. Met, Leu, Trp, Tyr 

Cys,Thr 

Scr.Val 

Phe, Tyr 

His, Phe, Trp 

lie. Leu.Thr 



Conservative amino acid nibstitutions generally maintain (a) the structure of the polypeptide 
backbone in the area of the mbstitution, for example, as a beta sheet cr alpha helical conformation, 
(b) the charge or hydropbobidry of the molecule at the site of the substitution, anOVor (c) the bulk of the 
side chain. 

15 A "ddetion" refers to a change in the amino add cr nucleotide sequence that results in the 

The term "derivative" refers to a chemically modified polynucleotide or polypeptide. Chemical 
modifications of a poly n ucleotide can include, for example , rep lacem ent of hydrogen by an alkyl. acyt, 
hydroxys, or amino group. A derivative polynu cl eotide encodes a polypeptide which ret a i n s at least one 

20 bidogical or mrmunologicaJ function of the natural mokcule A derivative polypeptide to one modified 
by glycosylation, pegylation, or any similar process (hat retains at least one biological or immunological 
function of the polypeptide from which it was derived. 

A "detectable labor refers to a reporter molecule or enzyme that b capable of generating a 
measurable signal and is covalertly or noncovalentfy jdned to a polynucleotide or polypeptide. 

25 A "fragment" is a unique portion of GCREC or the polynucleotide encoding GCREC which 

to Identical in sequence to but shorter Id length than the parent sequence A fragment may comprise 
up to the entire length of the defined sequence, minus one nucleotide/amino add residue. For 
example, a fragment may comprise from 5 to 1000 contiguous nucleotides or amino add residues. A 
fragment used as a probe, primer, antigen, therapeutic molecule, or f or other purposes, may be at least 

30 5. 10. 15,16,20,25,30.40,50,60.75, 100, 150.250or at least 500 comigtwus itodeotidex or 
amino add residues In length. Fragments may be preferentially selected from certain regions of a 
molecule. For e x a m ple a polypeptide fragment may comprise a certain length of contiguous amino 
adds selected from the first 250 or 500 amino adds (or first 25* or 50%) of a polypeptide as shown 
in a certain defined sequence. Clearly these lengths are exemplary, and any length that is supported 

35 by the specification, including the Sequence Listing, tames, and figures, may be encompassed by the 



A fragment of SEQ ID NO.22-42 comprises a region of unique polynucleotide sequence that 
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specifically identifies SEQ ID NO. 22-4 2, for cum pic, u distinct in^ppCxher sequence in the 
genome from which die fragment was obtained. A fragment of SEQ ID NO: 2 2 -4 2 is useful, for 
example, in hybridization and amplification technologies and In analogous methods that distinguish 
SEQ ID NO:22-42 from related polynucleotide sequences. The precise length of a fragment of SEQ 
ID NO: 22-42 and the region of SEQ ID NO: 22-42 to which the fragment corresponds are routinely 
determinable by one of ordinary skill in the an based on the intended purpose for the fragment. 

A fragment of SEQ ID NO:l -21 is encoded by a fragment of SEQ ID NO:22-42. A fragment 
of SEQ ID N 0:1-21 comprises a region of unique amino add sequence that specifically identifies 
SEQ ID NO.1-21. For example, a fragment of SEQ ID NO: 1-21 is useful as an immunogenic peptide 
fox the development of antibodies dm specifically recognize SEQ ID NO: 1-21. The precise length of 
a fragment of SEQ ID NOl-21 and rhe region of SEQ ID NOtl-21 to which the fragment 
corresponds are routinely determinable by one of ordinary skin in the art based on the intended 
purpose for the fragment. 

A "full length** polynucleotide sequence is one containing at least a translation initiation codec 
(eg., methionine) followed by an open reading frame and a translation termination codon. A "mil 
length" polynucleotide sequence encodes a "full length" porypeptide sequence. 

"Homology" refers to sequence similarity or. interchangeably, sequence identity, between two 
or more polynucleotide sequences or two or more polypeptide se qu ences. 

The terms "percent identity" and "% idertity ,** as applied to polynucleotide sequences, refer to 
the percentage of residue matches between at least two polynucleotide sequences aligned using a 
ttandardbed algorithm. Such an algorithm may insert, in a sta n da r dized and reproducible way. gaps m 
the sequences being compared in a 



s may bed 

parameters of the CLUSTAL V algorithm as Incorporated into the MEO ALIGN version 3. 12e sequence 
alignment program This program is part of the LASER GENE software package, a suite of molecular 
biological analysis programs (DNASTAR. Madison WT). CLUSTAL V is described in Hlggirts, D.C. 
and P.M. Sharp (1989) CABIOS 3:131-133 and in Higgtos. D.O. et a!. (1992) CABIOS 8:189-191. 
For pafrwise ali gnments of polynucleotide sequences, the default parameters are set as follows: 
KtupJe=2, gap penany=3, windows*, and "diagonals saved"=4. The "weighted" residue weight table is 
selected as the default. Percent Identity is reported by CLUSTAL V as the "percent similarity" between 



Alternatively, n suite of commonly used and freely available sequence comparison algorithms is 
provided by the National Center for Biotechnology Information (NCBI) Basic Local Alignment Search 
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Tool (BLAST) (Ahxchol, S.F. et aL (1990) J. Mol. Btot 215:403-410). which is available from a 
sources, inducing the NCBL Bethesda, MD, and on the Internet at 

WipV/www.rcbi.nlm. nib gov/B LAST/. The BLAST software suite includes various sequence analysis 
programs including "bUstn." that is used to align a known polynucleotide sequence with other 
3 polyraxJeotide s e qu e n ces from a variety of databases. Also available is a tool called "BLAST 2 
Sequences" that is used for direct pairwise comparison of two rmdrfjhfr tf q wg KP "BLAST 2 
Sequences" can be accessed and used interactively at hitp/tew.actt.nbTLa^^/g<xVblTHjrt. The 
"BLAST 2 Sequences" tool can be used for both blastn and brastp (discussed below). BLAST 
progrwrB are ccenmonly used with gap and other parameters set to default settings. For example, to 
10 compare two nucleotide sequences, one may use Uastn wfch the "BLAST 2 Sequences" md Venice 
2-0-12 (April-2 1-2000) set at default parameters. Such default parameters may be, for example: 
Matrix BLOSUM62 
Reward for match: J 

u -2 



1 3 Open Cap: 5 and Extension Gap: 2 penalties 

Gap x drop-off: 50 
Expect: 10 
Word Size: U 
Filter: on 

20 Percent identity may be measured over the length of an entire defined sequence, for example, as 

defined by • particular SEQ ID number, or may be measured over a shorter length, for example, over 
the length of a fragment taken from a larger, defined sequence, for Instance, a fragment of at least 20, at 
least 30. at least 40. at least 30. at least 70, at least 100, or at least 200 contiguous -wt™.^ Such 
lengths are exemplary only, and it is understood that any fragment length supported by the sequences 

23 shown herdn. In the tables, figures, or Sequence Listing, may be used to describe a length over which 
percentage identity may be measured. 

Nucleic acid sequencer that do not show a high rtegrce of identity may ijeverthdess encode 
similar amino add secpences due to the degeneracy of the genetic code It is urjderstood that changes in 
a nucleic add sequence can be made using this degeneracy to produce multiple nucleic ar id sequences 

30 that all encode substantially the same protdn. 

The phrases "percent identity" and "% identity,'' as applied to polypeptide sequences, refer to 
the percentage of residue matches between at least two polypeptide sequences aligned using a 
standardized algorithm. Methods of polypeptide sequence alignment are well-known. Some alignment 
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explained in more detail above, generally preserve the charge and^ydrophobidty at the Site Of 
substitution, thus preserving the structure (and therefore function) of the polypeptide. 

Percent identity between polypeptide sequences may be determined using the default parameters 
of the CLUSTAL V algorithm as Incorporated into the MEG ALIGN version 3. 12c sequence alignment 
program (described and rdcrenced above). Far pairwise alignments of pdypeptide sequences using 
CLUSTAL V, the default parameters are set as follows: Kruple= 1 , gap penally** 3, window=3. and 
"diagonals saved"<*3. The PAM230 matrix it selected as the default residue weJght table. As with 
polynucleotide alignments, the percent identity is reported by CLUSTAL V as the "percent similarity" 
between aligned polypeptide sequence pairs. 

, Alternatively the NCBI BLAST software suite may be used. For example, for a pairwise 
comparison of two polypeptide se qu ence s, one may use the "BLAST 2 Sequences" tool Version 2.0.12 
(April-2 1-2000) with blastp set at default parameters. Such default parameters may be, for example: 
Afcrrixr BLOSUM62 

Open Gap: 11 and Extension Gap: I penalties 
Gap x drop-off: 50 
Expect: 10 
Word Size: 3 
Filler: on 

Percent identity may be measured over the length of an entire defined polypeptide sequence, for 
example, as defined by a particular SEQ ID number, or may be measured over a shorter length, for 
example, over the length of a fragment taken from a larger, defined polypeptide sequence, for Instance, 
a fragment of at least 13, at leas 20, at least 30, at least 40. at least 30, at least 70 or at least 130 
cortiguous residues. Such lengths are exemplary only, and it is understood that any fragment length 
supported by the s equences shown herdn. In the tables, figures or Sequence Listing, may be used to 



H (HACs)a 

DNA sequences of about 6 kb to 10 Mb In size and which contain all of the dements required for 
chromosome replication, segregation and maintenance. 

The term "humanized antibody" refers to an antibody molecule in which the amino acid 
sequence in the non-antigen binding regions has been altered so that the antibody more dosdy 
resembles a human antibody, and still retains its original binding ability. 

"Hybridization" refers to the process by which a pdynucleotide strand anneals with a 
complementary strand through base pairing under defined frypridlzation conditions. Sped fie 
hybridization is an Indication that two nucleic add sequences share • high degree of complement arity. 
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after the "washing" step(s). The washing step(s) Is particularly important in determining the stringency 
of the Irybridization process, with more stringent conditions allowing less non-specific binding. I.e., 
binding between pairs of nuddc add strands that are not perfectly matched. Permissive conditions for 
annealing of nuddc add sequences are routinely determinable by ore of ordinary skill in the en and 
may be consistent among hybridization experiments, whereas wash conditions may be varied among 
experiments to achieve the desired stringency, and therefore hybridization sped fid ry. Pertnissive 
annealing conditions occur, for example, at 68 *C in the presence of about 6 x SSC, about 1% (w/v) 
SDS, and about 100 ug/ml sheared, denatured salmon sperm DNA. 

Generally, stringency of hybridization is expressed, in part, with reference to the temperature 
under which the wash step is carried out. Sodi wash temperatures are typically selected to be about 
5*C to 2CC lower than the thermal melting point (TJ for the specific sequence at a defined kxdc 
strength and pH. The T m is the temperature (under defined ionic strength and pli) at which 30% of the 
target sequence hybridizes to a perfectly matched probe, An equation for calculating T, and conditions 
for nucleic add hybridization are wefl known and can be found in Sambrook, J.etal. (19891 Molecular 
Cloning A La boratory Manual. 2* ed. vol. 1-3, Cold Spring Harbor Press, Plainview NY; specifically 
see volume 2, chapter 9. 

High stringency conditions for Irybridization between pdynucleotides of the present Invention 
include wash condJiioru of 68*C in the presence of about 0.2 x SSC and about 0.1 * SDS, for 1 hour. 
Alternatively, temperatures of about 65 *C, 60°C, 53"C or 42"C may be used. SSC corxxraration may 
be varied from about 0.1 to 2 a SSC. with SDS bdng present at about 0.1%. Typically, blocking 
reagena are used to block rx>r>-«pedfic hybridization. Such blockirig reagents include, for instance, 
sheared and denatured salmon sperm DNA at about 1 00-200 ug/ml. Organic solvent, such as 
farmanude at a concentration of about 33-30% v/v, may also be used under particular droimstances, 
such at for RNA:DNA hybridizations. Useful variations on these wash cooc&tiorn will be readily 
apparent to those of ordinary skill in the art. Hybridization, particularly under high stringency 
conditions, may be suggestive of evolutionary similarity between the nucleotides. Such similarity is 
strongly indicative of a similar role for the nucleotides and their encoded polypeptides. 

The term "hybridization complex" refers to a complex formed between two nucleic add 
sequences by virtue of the formation of rrydrogen bonds between cornplementary bases. A hybridization 
complex may be formed in solution (eg.. or fV analysis) or formed between one nuddc add 
sequence present in solution and another nuddc add sequence imrmbfilxed on a solid support (eg., 
paper, membranes, filters, chips, pins or glass slides, or any other appropriate substrate to which ceQs 
or their nuddc acids have been fixed). 
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disorders, or inTecUous or senetic <Ssease, etc. These conditions can be characterized by expression of 
5 various factor*, eg., cytokines, chemoUne*. and other signaling molecules, which may affect cdhilar 



An "Immunogenic fragment" b a polypeptide or oligopeptide fragment of GCREC which Is 
capable of did ling an tannine response when introduced Lao a Irving organism, for example, a 
mammal The term -Immunogenic fragment - also includes any polypeptide or oligopeptide fragment of 
10 GCREC ■which b useful in sny of the antibody production methods disclosed herein or known in the art. 

The term •"rdooanay" refers to an arrangement of a plurality of polynucleotides, polypeptides, 
or other chemical compounds on a substrain. 

The terms "element" and "array dement" refer to a poSynudcoUde, polypeptide, or other 

13 The term "modulate" refers to ■ change in the acdviry of GCREC. For gamete, modulation 

may cause an increase or a decrease in protein activity, binding characteristics, or any other biological, 



The phrases "nucleic add" and "nucleic add sequence" refer to a nucleotide, oogonucleoUte, 
t thereof. These phrases also refer to DNA or RNA of genomic or 

oubto-stranded and may represent the sense or the 
e strand, to peptide nucleic add (PNA). or to any DNA-like or RNA-like material. 
" Opera Wy linked" refers to the situation in which a first nuddc add sequence is placed in a 
functional relationship with > second nucleic add sequence, For Instance, a promoter Is operabiy 
linked to a coding sequence if the promoter affects the transcription or expression of the coding 
25 sequence. Operabiy linked DNA sequences may be in dose proximity or contiguous and, where 
necessary to Join two protein coding regions. In the same reading frame, 

"Peptide nuddc add" (PNA) refers to an antlsense molecule or anti-gene agent which 
comprises an cttgcoKfecddecfu 

amino add residues ending in lysine The terminal lysine confers solubility to the composition. PNAs 
30 preferentially bind complementary single stranded DNA or RNA and stop transcript dongation, and 
may be pegylated to extend their lifespan in the cdl. 

"Post-traralattonal nxxfiScatiocr of an GCREC may involve lipktatkm, gh/cosylation. 
phosphorylation, acetylation, raceroization, proteolytic deavagc, and other modifications known in the 
an. These processes may occur synthetically or biochemically. Btocherntcal m odi fi ca t ions wfl] vary by 




Typical 



short nuddc adds, usually DNA oligonucleotides, which may be a nn ea l ed to a target polynucleotide by 
co mptonrrtsT y base-pairing, The prims' may then be extended along the target DNA strand by a DNA 
polymerase enzyme Prima pairs can be used for amplification (and identification) of a nucleic add 
sequence, eg, by the polymerase chain reaction (PCR). 

Probes and primers as used in the present invention typically comprise at least 15 contiguous 
nucleotides of a known sequence. In order to enhance spedfidty, longer probes and primers may also 
be employed, such as probes and primers that comprise at least 20. 25. 30, 40. 50, 6ft 70, 80, 90. 100, 
or at least 150 consecutive nucleotides of the dtedosed mKJcsc add sequences. Probes and primers may 
be considerably longer than these examples, and his understood that any length supported by the 
specification, including the tables, figures, and Sequence Listing, may be used. 

Methods for preparing and using probes and primers are described In the references, for 
example Sambrook, ). d al. (1989) Molecular Cfornmr A Laboratory Manual. 2- ed.. vol. 1-3. Cold 
Snring Harbor Press. Ptahrdew NYTAusnbd. RM. et al ri98Tl Current Pro, cents In Molecular 
Btotogy. Greene PuN. Assoc A Wney-Intersderces. New York NY; Innis, M. et al (1990) PCR 
Protocols, A Guide to Methods and Applications. Academic Press. San Diego CA. PCR primer pairs 
can be derived from a known sequence, for example, by using computer programs intended for that 
purpose such as Primer (Version 0.5. 1991. Whitehead Institute for Biomedical Research, Cambridge 
MA). 

Oligonucleotides for use as primers are sdected using software known in the art for such 
purpose. For example, OL1GO 4.06 software is useful for the selection of PCR primer pairs of up to 
100 nucle o tides each, and for the analysis of oligonucleotides and larger poryindeotides of up to 5.000 

programs have incorporated additional features for expanded capabilities. For example, the PrimOU 
primer selection program (available to the public from the Genome Center at University of Texas South 
West Medical C ent er, Dallas TX) is c a p a b le of choosing specific primers from mega base sequences 
and is thus useful for designing primers on a genome- wide scope. The Primer 3 primer selection 
program (available to the public C 

Cambridge MA) allows the user to input a "nrisprinriag library,'* in v 
binding sites are user-sped fieri. Primer 3 is useful, in particular, for the selection of d 
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nwCroarrays. (The source code for the tatter two primer selection programs may also be obtained from 
their respe cti ve sources and modified to meet the user's sped fie needs.) The PrimeGen program 
(available to the public from the UK Human Genome Mapping Project Resource Centre, Cambridge 
UK) designs primers based on multiple sequence alignments, thereby allowing selection of primers that 



Hence, this program is useful for identification of both unique and conserved oUgonudeot irirs and 
polynucleotide fragments. The oligonuc leo tides and polynucleotide fragments identified by any of the 
above selection methods are useful to bybrkfization technologies, for example, as PCR or sequendng 
primers, uiiciooiiay dements, or spedfic probes to identify fully or partially complementary 
10 polynucleotides in a sample of nuddc adds. Methods of oligonucleotide selection are not limited to 



A "recombinant nuddc add** b a sequence that is not naturally occurring or has a sequence 
that Is made by an artificial combination of two or more otherwise separated segm e nts of sequence. 
This artificial combination is often accomplished by c hem ica l synthesis or, more commonly, by the 

IS artificial manipulation of Isolated segments of nu cleic a dd s , eg., by genetic en gin eer ing techniques 
such as those described in Sambrook, supra . The term recombinant ii alu des nuddc adds that have 
been altered solely by addition, substitution, or deletion of a portion of the nu ddc add. Frequently, a 
recombinant nuddc add may include a nud d c add sequence operabiy linked to a pro m ot er s eque nc e. 
Such a recombinant nuddc add may be part of a vector that is used, for example, to transform a cell. 

20 Alternatively, such recombinant uodeic adds may be part of a viral vector. e.g., based on ■ 

vaccinia vfrus. that could be use to vaccinate a mammal wherein (he recombinant nuddc add is 



A "regulatory dement** refers to a raxtdc add sequence usually derived from untranslated 
regions of i gene and includes enhancers, promoters, imrons, and 5* and 3* untranslated regions (UTRs). 
Regulatory dements interact with host or viral protdns which control transcription, translation, or RNA 
stability. 



amino add, or antibody. Reporter n 



An "RNA equivalent,'' In reference to a DNA sequence. Is composed of the *■ 
sequence of nucleotides as the reference DNA sequence with the exception thai all o 
nitrogenous base thymine are replaced with uracil, and the sugar backbone is composed of ribose 



The term "sample" is used in its broadest sense. A sample suspected of containing GCREC. 
nucleic adds encoding GCREC, or fragments thereof may comprise a bodily fluid; an extract from a 
cell, chromosome, organdie, or membrane isolated from a cdl; a cell; genomic DNA, RNA, or cDNA 
in solution or bound to a substrate; a tissue; a tissue print; etc 

The terms "spedfic binding" and "spedfically binding" reft 
protein or peptide and an agonist, an antibody, an antagonist, a small m o tecutc , or any n 



of the prutdn, eg., the antigenic detaiulnant or epitope. tfi 'O gn W ed by the binding molecule. For 
example, if an antibody is spedfic for epitope "A" the presence of a polypeptide comprising the epitope 
10 A, or the presence of free unlabdcd A In a reaction containing free labeled A and the antibody win 
reduce the amount of labeled A that binds to the antibody. 

The term "substantially purified" refers to nucleic add or amino add seq u en ce s that are 
removed from thdr natural environment and are isolated or separated, and are at least 60% free, 
preferably at least 75* free, and most preferably al least 90% free from other components with which 
15 they are naturally associated. 

A "substitution" refers to the replacement of one or more amino add residues or nucleotides by 
i amino add r esi d ues or nucleotides, respectively. 
"Substrate" refers to any suitable rigid or semi-rigid support including n 



20 mkroparticles and capillaries. The substrate can have a variety of surface forms, such as wells, • 
trenches, pins, channels and pores, to which polynucleotides or polypeptides are bound 

A "transcript image" refers to the collective pattern of gene expression by a particular ecu type 
or tissue under given conditions at a given time. 

Transformation" describes a process by which exogenous DNA is Introduced into a redplent 

25 cdl. Transformation may occur under natural or artificial conditions according to various methods wen 
known in the art. and may rdy on any known methed for the insertion of foreign nuddc add sequences 
into a proiaryotic or eukaryotic host cell. The method for transformation b selected based on the type 
of host cdl bdng transformed and may include, but is not limited to, bacteriophage or viral Infection, 
clectraporation, heat shock, lipofection, and partidc bcanbardment. The term "Transformed cdts" 

30 includes stably transformed ceUs in which the Inserted DNA is capable of replication chher as an 

autonomously replicating ptasnrud or as part of the host chromosome, as wet! as transiently transformed 
cdfct which express the Inserted DNA or RNA for limited periods of time. 

A "transgenic organism," as used herein, is any org anism , indudtog but not limited to 
animals and plants. In which one or more of the ceDs of the organism contains heterologous nuddc 
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acid inorjduced by way of human inccrrention, such as by transgcninHKques wen known in the 
an. The nudac add is introduced Into the ceil, directly or indirectly by introduction into • precursor 
of (be eel), by way of deliberate genetic manipulation, such as by microinjection or by infection with 
a recombinant virus. The term genetic manipulation does not include classical cross-breeding, or in 
vitro fertilization, but rather is directed to the introduction of a recombinant DNA molecule. The 
transgenic organisms contemplated in accordance with the present invention include bacteria, 
cyanobacteria. fungi, plants and animals. The isolated DNA of the present invention can be 
introduced into the host by methods known in the an. for example inn 
transformation or traitsconjugatico. Techniques for transferring tbe DNA o 
into such organisms are widely known and provided In references such as Sarobrook et al. (1989X 



s a nucleic add sequence having at 
least 40* sequence identity to the particular nucleic add sequence over a certain length of one of die 
nuddc add sequences using btastn with the "BLAST 2 Sequences" tool Version 10.9 (May-07-1999) 
set at default parameters. Such a pair of nucleic adds may show, for example, at least 50*, at least 
60*. at least 70*. at least 80%, at lean 85*. at lean 90*. at least 93* or at lean 98* or greater 
sequence identity over a certain defined length. A variant may be described as, for example, an "6116116" 
(as defined above), "splice," "sp e ci e s," or "polymorphic'' variant. A splice variant may have t ig mf ira nt 
identity to a reference molecule, but will generally have a greater or lesser number of polynucleotides 
due to alternative splidng of exons during mRNA processing. The corresponding polypeptide may 

Spedes variants are peJynudeotide seque nce s that vary from one spades to another. The resulting 

wrafly have significant amino add identity relative to each ether. A polymorphic 



variant is a variation in tbe polynucleotide sequence of a particular gene between Individuals of a given 
spedes. Polymorphic variants also may encompass "single nucleotide polymorphisms" (SNPs) in 
which the pdy nucleotide sequ enc e varies by one nucleotide base. The presence of SNPs may be 
indicative of, for e^'mplf, a certain population, a disease state, or a p r o pen si ty for a disease state. 

A "variant" of a particular polypeptide sequence is defined as a polypeptide sequence having at 
lean 40* sequence identity to the particular polypeptide sequence over a certain length of one of the 
polypeptide sequences using biastp with the "BLAST 2 Sequences" tool Version 2.0.9 (May-07-1999) 
sd at default parameters. Such a pair of polypeptides may show, for example, at lean 50%, at lean 
60*. at lean 70%. at lean 80%. at lean 90*. at lean 95*. or at lean 98* or greater sequence 
identity over a certain defined length of one of the polypeptides. 



WO 01/570*5 



PCT/IJS01/014S5 



THE INVENTION 

The invention is based on the discovery of cew human Gprotein coupled receptors (GCREC), 
the polynucleotides encoding GCREC, and the use of these compositions for the diagnosis, treatment, or 
prevention of ceD proliferative, neurological, cardiovascular, gastrointestinal, 
autounmune/mflainmatory, and metabolic disorders, and viral infections. 

Table 1 summarizes the nomenclature for die full length polynucleotide and polypeptide 
sequences of the Invention. Each porynucleotide and Us corresponding polypeptide are correlated to a 
single Incyte project Identification number (Incyte Project ID). Each polypeptide sequence is denoted 
by both a polypeptide sequence identification number (Polypeptide SEQ ID NO:) and an Incyte 
polypeptide sequence number (Incyte Polypeptide ID) as shown. Each porynudeotide sequence fa 
denoted by both ■ pofynudeotide sequence identification lumber (Polynucleotide SEQ ID NO:) and an 
Incyte poly n u cl eotide consensus sequence number (incyte Pdynucleotide ID) as shown. 

Table 2 shows sequences with homology to tbe polypeptides of the invention as identified by 
BLAST analysis against the GenBank protein (genpept) database. Coturons 1 and 2 show the 
polypeptide sequence identification number (Polypeptide SEQ ID NO:) and the corresponding Incyte 
a* (Incyte Polypeptide ID) for polypeptides of the inve n tio n . Column 3 
t (Genbank ID NO:) of the nearest GenBank hcenoiog. 
Column 4 shows the probability aeore for the match between each polypeptide and Its GenBank 
homdog. ColumnS Shows the annotation of the GenBank homdog along with relevant citations where 
applicable, all of winch are expressly incorporated by reference herein. 

Table 3 shows various structural features of the polypeptides of the invention. Columns I and 2 
show the polypeptide sequence identification number (SEQ ID NO:) and the corresponding Incyte 

e number (Incyte Polypeptide ID) fix each polypeptide of the invention. Column 3 
t of amino add residues in each polypeptide. Column 4 shows potential 
phosphorylation sites, and column 5 shows potential glycosyiafion rites, as determined by the MOTIFS 
program of the GCO sequence analysis software package (Genetics Computer Group, Madison WT). 



shows analytical methods for protein structure/function analysis and in some cases, searchable 
databases to which the analytical methods were applied. 

As shown in Table 4, the full length polynucleotide sequences of the present invention were 
assembled using cDNA sequences or coding (exon) sequences derived from genomic DNA, or any 
combination of these two types of sequences. Columns 1 and 2 list the polynucleotide sequence 
identification number (Polynucleotide SEQ ID NO:) and the corresponding Incyte polynucleotide 
consensus sequence number (Incyte Pdynucleotide ID) for each pdynucleotide of the invention. 
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Column 3 shows the length of each pdynucleotide sequence in basepairs. Column 4 list* fragments of 
tbe pdynucleotide sequences which are useful, for example, in hybridization or amplification 
technologies that identify SEQ ID NO: 22 -42 or that distinguish between SEQ ID NO:22-42 and 
related polynucleot i de sequences. Column 5 shows Identification numbers corresponding to cDNA 
sequences, coding sequences (exons) predicted from genomic DNA, and/or sequence assemblages 
comprised of both cDNA and genomic DNA. These sequences were used to assemble the full length 
pdynndrrjitde sequences of the invention. Coiamns 6 and 7 of Table 4 show the nucleotide start (5*) 
and nop (30 positions of the cDNA and genomic sequences in column 5 relative to their respective fun 



s in Column 5 of Table 4 may refer specifically, for example, to 
Incyte cDNAs along with their corresponding cDNA libraries. For example, 5080262H 1 is the 
Identincation number of an Incyte cDNA sequence, and LNODNOT1 1 Is the cDNA library from which 
it is derived. Incyte cDNAs for which cDN A libraries are not Indicated were derived from pooled 
cDNA libraries (eg. SBSA02572V1). Alternatively, the Identification numbers in column 5 may refer 
to GenBank cDNAs or ESTs (eg., g4589483_CD) which contributed to the assembly of the full length 
pdynu decode sequences. Alternatively, the identification numbers in column 5 may refer to coding 
regions predicted by Genscan analysis of genomic DNA. For example, GhrN.g5902227_030.edit is the 
identification number of a Genscanpredicted coding se quence , with £5902227 being the GenBank 
identification number of the sequence to which Genscan was applied. The Genscan-precbcted coding 
sequences may have been edited prior to assembly. (See Example IV.) Altcrnativdy. the identification 
numbers In column 5 may refer to assemblages of both cDN A and Gcascanpredided exons brought 
together by an "exon stitching" algorithm. (See Example V.) Alternatively, the Identification numbers 
in column 5 may refer to assemblages of both cDNA and Genscan-predicted exons brought together by 
an "ex on- stretching" algorithm. (See Example V.) In si 



Table 5 shows the representative cDNA libraries for those full length polynucleotide sequences 
which were assembled using Incyte cDNA sequences. The representative cDN A library is the Incyte 
cDNA library which is most frequently represented by the Incyte cDNA sequences which were used to 
assemble and confirm the above pdynucleotide sequences. The tissues and vectors which were used to 
construct the cDNA libraries shown in Table 5 are described in Table 6. 

The invention also encompasses GCREC variants. A preferred GCREC variant is one winch 
has at lean about 80*. or alternatively at lean about 90*. or even at lean about 95* amino add 
sequence identity to the GCREC amino add sequence, and whkh comains al least one functional or 



WO 01/57085 



PCT/US01/03455 



structural characteristic of GCREC. 



s polynucleotides which encode GCREC. In a particular 



the group consisting of SEQ ID N 0:22-42. which encodes GCREC. The polynucleotid e sequence- of 
SEQ ID NO:22-42, as presented in the Sequence Listing, embrace the equivalent RNA sequences, 
wherein occurrences of the nitrogenous base thymine are replaced with uradl, and the sugar backbone 
is composed of ribose instead of deoxyribose. 

The invention also encompasses a variant of a polynucleotide sequence encoding GCREC. in 
particular, such a variant porynucleotide sequence will have at least about 70*. or altemativdy at least 
about 85*. or even at least about 95* polynucleotide sequence identity to the polynucleotide sequence 
encoding GCREC. A particular aspect of the invention encompasses a variant of a potynodeotide 
sequence comprising a sequence selected from the group consisting of SEQ ID N022-42 which has at 
least about 70*. or alternatively at least about 85%. or even al least about 95* pdynudectide 
sequence identity to a nuddc acid sequence selected from the group consisting of SEQ ID N022-42. 
Any one of the polynucleotide variants described above can encode an amino add sequence wmoh 
contains at lean one functional or rtructural characteristic of GCREC. 

It will be appreciated by those skilled in the art thai as a result cS the degeneracy of trie genetic 
code, a multitude of pdynucleotide sequences encoding GCREC, some bearing minimal similarity to 
the polynucleotide sequences of any known and naturally occurring gene, may be produced. Thus, the 
invention contemplates each and every possible variation of pdynucleotide sequence that could be made 
by selecting combinations based on possible codon choices. These combinations are made in 
accordance with the standard triplet genetic code as ap p li ed to the porynucleotide s e qu ence of naturally 
occurring GCREC, and all such variations are to be considered as bang specifically disclosed. 

s which encode GCREC and its variants arc generally capable of 




GCREC or 



occurring cottons. Cottons may be selected to increase the rate at which expression of the peptide 



30 particular codons are utilized by the host Other reasons for substantially altering the nucleotide 
seq u ence encoding GCREC and Its derivatives without altering the encoded amino add seq u ences 
Include the production of RNA transcrtpo having more desirable 
than transcripts produced from the naturally occurring sequence. 
The invention also encompasses production of DNA 
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CCREC derivatives, or fngmcns thereof, entirely by ryntheUc 



Also encompassed by the invention are poly n ud eo tide sequences thu are capable of 
hybridizing to the claimed porynudeotide sequences, and. in particular, to those shown in SEQ ID 
NO: 22-42 and fragments thereof under various co rx Bii rag of stringency. (See, eg.. Want. CM. and 
S.l_ Berter (1987) Methods Enzymd. 152:399407; Kimmd, A.R. (1987) Methods Enzyme*. 
152:507-51 1.) Hybridization con&tions. Including annealing and wash awdtkms, arc described In 



Methods for DNA sequencing are well known In the an and may be used to practice any of the 
enibodlinentsofthelnvtiaion. The methods may employ such enzymes as the Klenow fragment of 
DNA polymerase I, SEQUENASE (US Biochemical. Cleveland OH). Taq polymerase (Applied 
BkxystemsX thermosubfc T7 polymerase ( Amersham Pharmacia Biotech, Ptecataway NJ). or 
combinations of polymerases and proofreading exonudcases such as those found In the ELONGASE 
amplification system (Life Technologies, Grilhersburg MD). Preferably, sequence preparation Is 
automated wfth machines such as the MICROLAB 2200 liquid transfer system (Hamilton. Reno NV), 
PTC200 thermal cycler (MJ Research, Watertown MA) and ABi CATALYST &00 thermal cyder 
(Applied Btasysteras). Sequencing is then carried out using dlher the ABI 373 or 377 DNA sequencing 
system (Applied Bksystems). the MEGABACE 1000 DNA sequencing system (Molecular Dynamics, 
Sunnyvale CA), or other systems known in the art. The resulting s equen ce s are analyzed using a 
variety of algorithms which are wrfl known in the art. fSee. eg.. AnsnbeL F.M. (19971 Short Protocols 
m Mntecnlar Biology. John Wiley & Sons. New York NY, unit 7.7; Meyers, R.A. (19951 Molecular 
Biology and Btatechnologv. WDey VCH. New York NY, pp. 836-853.) 

The nuddc acid sequences encoding CCREC may be extended utilizing a partial nucleotide 
sequence arid employing various PCR -based methods known in the art to detect upstream sequences, 
such as promoters and regulatory dements. For example, one method which may be emptoyeri, 
restriction-site PCR, uses universal and nested primers to amplify unknown sequence from genomic 
DNA within a cloning vector. (See. eg.. Sarkar. G. (1993) PCR Methods AppUc. 2:318-322.) 



sequence from a circularized te mpl a t e, The template is derived from restriction fragments comprising a 
known genomic locus and surrounding sequences. (See. eg.. Triglia, T. etaL (1988) Nuddc Adds 
Res. 16:8186.) A third method, capture PCR, involves PCR amplification of DNA fragments adjacent 

e DNA. (See. eg., Lagerstrom, M. et al. 
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(1991) PCR Methods AppUc. 1:111-119.) In this method, 
ligations any be used to Insert u engineered dou b l e -strand 



3 wtrirti may be used cc 

are known In the art. (See, eg.. Parker. J.D. et aL (1991) Nucleic Adds Res. 19J035-3060). 
Additionally, one nay use PCR, nested primers, and PROMOTERHNDER libraries (Clontoch, Palo 
Alto CA) to walk genomic DNA. This procedure avoids the need to screen libraries and Is useful in 
finding intron/cxem junctions. For an PCR -based methods, primers may be designed using 
commercially available software, such as OL1GO 4.06 primer analysis software (National Bio sc iences, 
Plymouth MN) or another appropriate program, to be about 22 to 30 nucleotides in length, to have a 
GC content of about 50* or more, and to anneal to the template at temperatures of about 68*C to 
72-C. 

When screening for full length cDN As, It is preferable to use libraries that have been 
d to include larger cDN As. In addition, random- primed libraries, which often Include 

c 5' regions of genes, are preferable for situations In which an oligo d(T) library 
IS does not yield a full-length cDNA. Genomic libraries may be useful for exienstan erf sequence irao 5' 
non-transcribed regulatory regions. 

Capillary electrophoresis systems which are commercially available may be u 
size or confirm the nucleotide se o jiifhT e of s e q uen cing or PCR products, in parti cu la r , capillary 



specific, laser -stimnlatcd fluorescent dyes, and a charge coupled device camera for detection of the 
emitted wavelengths. Output/light Intensity may be converted to electrical signal using appropriate 
software (eg.. GENOTYPER and SEQUENCE NAVIGATOR. Applied BiosystemsX and the entire 
process frcoi loading of samples to computer analysis and etectronic data display may be compotrr 
controlled. Capmary de uio phoie sis Is especially preferable for sequencing small DNA fragments 
which may be present in limited amounts in a particular sample. 

In another e m bodi m ent of the invention, pol yn ucleotide sequences or fragments thereof which 
a DNA molecules that direct expression of CCREC, or 
reof, In appropriate host cells. Due to the inherent degeneracy of 



the genetic code, other DNA ti 
amino add sequence may be produced and used to express GCREC. 

The nucleotide s eq ue n ces of the present invention can be engineered using methods generally 
known in the art in order to alter GCREC-encocfing sequences for a variety of purposes including, but 
not limited to, modification of the cloning, processing, and/or expression erf the gene product. DNA 
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oligonucleotides may be used to engineer the nucleotide sequences. For example, oligonucleotide- 
medlated site-directed mutagenesis may be used to introduce mutations that create new restriction sites, 
alter glycosylation patterns, change codon preference, produce splice variants, and so forth. 

Tile nudeotides of the present invention may be subjected to DNA shuffling techniques such 
as MOLECULARB REEDING (Maxygen Inc. Santa Clara CA; described in U.S. Patent Number 
3,837.458; Chang, C-C et al. (1999) Nat. Biotechnol. 17:793-797; Christians, F.C. et al. (1999) Nat. 
Biotochnol. 17:259-264; and Crameri. A. et al. (1996) Nat Biotechnol. 14:315-319) to alter or 
Improve the biological properties of GCREC, such as its biological or enzymatic activity or its ability 
to bind to other molecules or compounds. DNA shuffling is a process by which a library of gene 
variants Is produ c ed using PCR-mediated recombination of geno fragments. The library Is then 
subjected to se le ction or screeni ng procedures that identify those gene variants with the desired 
properties. These preferred variants may (hen be pooled and further subjected to recursive rounds of 
DNA shuffling and selection/screening. Thus, genetic diversity is created through 'artifidal" 
breeding and rapid molecular evolution. For example, frag ment s of a single gene co nt a in i n g random 
point mutations may be rttorabincd, screened, and (hen reshuffled until the desired properties are 
optimized. Alternatively, fragments of a given gene may be recom b loed with fragments of 
homologous genes in the same gene Tamily, either from the same or different species, thereby 
maximizing the genetic diversity of multiple naturally occurring genes in a directed and controllable 



s encoding GCREC may be synthesized, in whole or in part, '• 
using chemical methods well known in the art. (See, eg., Caruthers. M.H. et al. (1980) Nucleic Adds 
Symp. Ser. 7:213-223; and Horn, T. et al. (1980) Nucleic Adds Symp. Ser. 7:225-232.) Alternatively. 
GCREC iUetf or a fragment thereof may be synthesized using chemical methods. For example, peptide 
synthesis can be performed using various solution-phase or solid-phase techniques. (See, eg., 
CrcLgfaton. T. M 9841 Proteins. Structures and Molecular Pronerties. WH Freeman. New York NY. no. 
55-60; and Roberge, J.Y. et aL (1995) Science 269:202-204.) Automated synthesis may be achieved 
using die ABI 43 1 A peptide synthesizer (Applied Btosystems). Additionally, the amino add sequence 
of GCREC, or any part thereof, may be altered ^"t*ig direct synthesis and/or combined with sequences 
from otiicr proteins, or any part thereof, to produce a variant polypeptide or a polypeptide having a 
sequence of a na rurally occurring polypeptide. 

The peptide may be substantially purified by preparative high performance liquid 
chromatography. (See, eg. CMez, R M. and F.ZL Regraer (1990) Methods Enzymd. 1 82:392-421 .) 
The composition of Ihe syntactic peptides may be confirmed by amino add analysis or by sequencing. 
(See, eg., Crrighton, sunjj, pp. 28-53.) 



or derivatives thereof may be inserted into an a; 



sequence in a suitable host. These elements Include regulatory sequ enc es, such as enhancers, 
constitutive and inducible promoters, and 5' and 3' untranslated regions in the vector and In 

s encoding GCREC. Such elements may vary in then- strength and spedQdty. 
Is may also be used to achieve more f f fkartf translation of sequences e n coding 
GCREC Such signals include the ATG Initiation codon and adjacent sequences, eg. the Kozak 
sequence. In cases where sequences encoding GCREC and its initiation codon and upstream regulatory 
sequences are inserted into the ap prop ri ate e xp re s sion vector, no additional transcriptional or 
translational coand signals may be n ee d ed . However, in cases where only coding sequence, or a 
fragment thereof, b inserted, exogenous era re l at ional control signals Inducing an in-frame ATG 
n should be provided by the vector. Exogenous translations! elements and tartiaticra 
, both natural and synthetic The efficiency of expression may be 
enhanced by the Inclusion of enhancers appropriate for the particular host ceH system used. (See, eg., 
Schaxf. D. et aL (1994) Results Probl. CeD Differ. 20:125-162.) 

Methods which are well known to those sldlled in the an may be used to construct expression 
3 encoding GCREC and appropriate transcriptional and translational 
. These methods include In vitro recombinant DNA Uxhnkjues, synthetic techniques, 
and to vivo genetic recombination. (See, eg.. Sambrook, J. et al. (1989) Mqlecqjar Oprine. A 
laboratory Manual . Cdd Spring Harbor Press, Plain view NY. eh. 4, 8, and 16-17; Ausuhd, F.M. et 
al. fl99Tt Current Protocol* in Molecular Biology. John WDey & Sons. New York NY. cb. 9. 13. and 
16.) 

A variety of expression vector/host systems may be utilized to contain and express sequences 
encoding GCREC. These indude, but are not limited to, microorganisms such as bacteria uansfbrcoed 
si bacteriophage, pi as (rid, or cosmM DNA expression vectors; yeast transformed with 
eO systems infected with viral expression vectors (eg., baculovfrus); 
plant ceD systems transformed with viral expression vectors (eg., cauliflower mosaic virus, CaMV. or 
tobacco mosaic virus, TMV) or with bacterial expression vectors (eg.. Tl or pBR322 piasmids); or 
animal ceD systems. (See, e g., Sambrook, supra: AusubeL ru pra : Van Heefce, G. and S.M. Schuster 
(1989) J. Biol. Chem. 264:5503-5509; Engelhard, E.K. et tl. (1994) Proc Natl. Acad. Sd. USA 
91:3224-3227; SandJg. V. et aL (1996) Hum, GeneTher. 7:1937-1945; Tikaroatsu. N. (1987) EMBO 
1. 6:307-31 1 ; The McOpw HjJJ YWTt>PP*. of Scjence and Trdmc*0tv(1992) McGrawHfll. New 
York NY, pp. 191-196; Logan, J. and T. Shenk (1984) Proc Natl. Acad, Sd. USA 81:3655-3659; and 
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Harrington, J J. et al. (1997) Nat Genet 15:345-355.) Exprcssion^^^erived from n 
adenoviruses, or herpes or vaccinia viruses, or from various bacterial plasmids, may be used for 
delivery of nucleotide se q ue n ces to the targeted organ, tissue, or cell population. (See, e.g., Di 
Nicola. M et al. (1998) Cancer Gen. Ther. 5(6):350-356; Yu, M et at. (1993) Proc Nail. Acad. Set 
USA 9O(13):6340-6344; Boiler, R.M. et aL (1985) Nature 317(604O):813-815; McGregor. D P. et al. 
(1994) MoL Immunol. 31(3):219-226; and Vcrma. IM. and N. Somia ( 1997) Nature 389:239-242.) 
d by (he host ceil employed. 

x of cloning and expression vectors may be s ele ct ed depending 
upon the use intended for polynucleotide sequences encoding CCREC. For example, routine ctoning. 
sobcloaing, and propagation of pody nucleotide sequences encoding GCREC can be achieved using a 
multifunctional E^cpJi vector such as PBLUESCRBT (Stratagem. La JoBa CA) or PSPORT1 piasmid 
(Ufe Technologies). Ligation of sequences encoding GCREC into the vector's multiple cloning site 
disrupts (he ladZ. gene, allowing a coJorimetric screening procedure for Identification of transformed 
bacteria containing recombinant molecules. In addition, these vectors may be useful for fo ynr p 
transcription, tadeoxy sequencing, single strand rescue with helper phage, and creation of nested 
deletions in the clonal sequence. (See, eg., Van Heeke, G. and S.M. Schuster (1989) J. Biol. Chem. 
264:5503-5509.) When large quantities of GCREC are needed, eg. for the production of antibodies, 
. vectors which direct high level expression of GCREC may be used. For example, vectors containing 
the strong. Inducible SP6 or T7 bacteriophage promoter may be used. 

Yeast expression systems may be used for production of CCREC. A number of vectors 
containing constitutive or inducible promoters, such as alpha factor, alcohol oxidase, and PGH 
promoters, may be used in the yeast S_ 



of foreign sequences into the host genome for stable propagation. (See, eg., AusubeJ, 1995. supra : 
Bltter,G.A. et al. (1987) Methods Enzyme*. 153516:544; and Scorer. CA. et al. (1994) 
Bk^echnotogy 12:181-1 B4.) 

Plant systems may also be used for expression of GCREC Transcription of sequences 
encoding GCREC may be driven by viral promoters, eg., the 35S and 19S promoters of CaMV used 
alone or in combination with the omega leader sequence from TMV (Takamaisu, N. (1987) EMBO J. 
6307-31 IX Alternatively, plant promoters such as the small summit of RUB IS CO or heat shock 
p r o m o t er s may be used. (See. eg,, Coruzzi, G. et al. (1984) EMBO J. 3:1 671-1680; Brogue, R. et al. 
(1984) Science 224*38-843; and Winter, J. et al. (1991) Results ProW. Cell Differ. 17:85-105.) These 
constructs can be Introduced into plant cells by direct DNA Oamformation or pathogen- media ted 
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New York NY. pp. 191-196.) 

In mammalian cells, a number of viral-based expression systems may be utilized. In cases 
where an adenovirus is used as an expression vector, sequences encoding GCREC may be Ugated Into 
an adenovirus transcription/hanstatian complex consisting of the late promoter and tripartite leader 
sequence Insertion in a non-essential El or E3 region of die vrxal gerwrne may be used to obtain 
infective virus which expresses GCREC in host certs. (See, eg., Logan. J. and T. Shenk (1984) Proc 
Nad. Acad. Set. USA 81 .3655-3659.) In addition, transcription enhancers, such as the Rous sarcoma 
virus (RS V) enhancer, may be used to increase expression in mammalian host cells. SV40 or EBV- 
based vectors may also be used for high-level protein expression. 

Human artificial ctsornosomes (H AO) may also be employed to deliver larger fragments of 
DNA than c 




or vesicles) for therapeutic purposes. (See, eg., Harrington, J J. et al (1997) Nat. Genet 15345-355.) 

For long term production of recombinant proteins in n 
GCREC in cell lines is preferred. For example, sequences encoding GCREC c 
cell lines using expression vectors which may contain viral origins o 
expression dements and a selectable marker gene on the same or on a separate v 
introduction of the vector, ceils may be allowed to grow for about 1 to 2 days In enriched rr 
being switched to selective media. The purpose of the selectable marker Is to confer resistance to a 
selective agent, and its presence allows growth and recovery of cells which successfully express the 
introduced sequences. Resistant clones of stably transformed cells may be propagated using tissue 
culture techniques appropriate to the cen type. 

Any number of selection systems may be used to recover transformed cefl lines. These include, 
but are not limited to. the herpes simplex virus thymidine kinase and Bdcnine pbosphoribosyttransferase 
genes, for use in if and opr* cells, respectively. (See. eg. Wigler, M et al. (1977) Cell 1 1 : 223-232; 
Lowy, L et al. (1980) Cell 22:817-823.) Also, antimetabolite, antmtotic, or herbicide resistance can be 
used as the basis for selection. For example, dhfr confers resistance to methotrexate; tuo confers 
resistance to the anunoglycoaictes nccwrycin and G-4 18; and air and par confer resistance (o . 
chlorsulfuron and phosr^rwtricin acetyltransferase, respectively. (See, eg_ Wigler. M. et al. (1980) 
Proc. Natl. Acad. Sci. USA 773567-3570; CoTbere-Garapin. F. et aL (1981) J. Mol. BioL 150:1-14.) 
Additional selectable genes have been described, eg., trpB and hisD, which alter cellular requirements 
for metabolites. (See, eg., Hartman, S C. and RC. Mulligan (1988) Proc. Natl. Acad Sci. USA 
85:8047-805 1.) Visible markers, eg., anthocyanlns, green fluorescent proteins (GFP; aontech). B 
3r ludferase and its substrate todferin a 
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n be used not. only to identify (ransformants, but also to quantify the amount of 
transient or stable protein expression attributable to a specific vector system. (See, eg., Rhodes. CA. 
(1995) Methods Mol. Biol. 55:121-131.) . 

Although the presence/absence of marker gene expression suggests that the gene of interest is 
also present, the presence and expression of the gene may need to be confirmed. For example, if the 
sequence encoding GCREC is inserted within a marker gene sequence, transformed cells containing 
se q ue nc es encoding GCREC can be identified by the absence of marker gene function. Alternatively, a 
marker gene can be placed in tandem with a sequence encoding GCREC under the control of a single 
promoter. Expression of the marker gene in response to induction or selection usually indicates 



In general, host cells that contain the nucleic acid u 
GCREC may he identified by a variety of procedures known to those of skill In the art These 
procedures include, but are not limited to. DNA -DNA or DNA-RNA bybriCttzauans, PCR 
amplification, and protein Moassay or Immunoassay techniques which include membrane, solution, or 
chip based technologies for the detection ancVor Quantification of nucleic acid or protein sequences. 

Immunologic*] methods for detecting and measuring the expression of GCREC using cither 
specific polyclonal cr racrtodorul antibodies are known In the art. Examples of such techniques include 
enzyme-linked immunosorbent assays (EUSAs), radioimmunoassays (RlAs), and fluorescence 
activated cell sorting (FACS). A two-site, monoclonal -based immunoassay utilizing monoclonal 
antibodies reactive to two non- interfering epitopes on GCREC is preferred, but a competitive binding 
assay may be employed These and other assays are well known in the art. (See, eg!, Hampton, R. et 
«!• (1990) Serological Methods, a Laboratory Manual. APS Press. St. Paul MN, Sect. IV; Coligan. J.E 
et al. (1997) Current Prrxncofa In rmrminnloev. Greene Pub. Associates and WDey-lnterscience, New 
York NY; and Pound. J.D. (1998) IrntTMtxxhcrrrtcal Protocols. Humana Press. Totowa NJ.) 

A wide variety of labels and conjugation techniques are known by those skilled in the art and 
may be used m various nucleic add and amino acid assays. Means for producing labeled bybrkbzauon 
or PCR probes for detecting sequences related to polynucleotides encoding GCREC include 
dlgolabdlng. nick translation, end-labeling, or PCR amplification using a labeled nucleotide 
Alternatively, the sequences encoding GCREC, or any fragments thereof, may be doned into a vector 
for the production of an mRNA probe. Such vectors are known in the art, are commercially available, 
and may be used to synthesize RNA probes h vitro by addition of an appropriate RNA pofyrnerase 
such as T7, T3, or SP6 and labeled nucleotides. These procedures may be conducted using a variety of 
cornnaerdally available kits, such as those provided by Amentum Pharmacia Biotech. Promega 
(Madison WI). and US Biochemical. Suitable reporter molecules or labels which may be used for ease 
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Host ceils transformed with nucleotide sequences encoding GCREC may be cultured under 
conditions suitable for the expression and recovery of the protein from cell culture. The protein 
produced by a transformed cefl may be secreted or retained intraceOularly depending on the sequence 
and/or the vector used As will be understood by those of skill in the art, e 



n vectors containing 

polynucleotides which encode GCREC may be designed to contain signal sequences which cfirect 
secretion of GCREC through a prokaryotic or eukaryotic ceO membrane, . 

In addition, a host cell strain may be chosen for its ability to modulate expression of the 
inserted sequences or to process the expressed protein in the desired fashion. Such modifications of the 
d to, acetylation, cartxnylation, glycosylation, phosphorylation, 
.tional processing which cleaves a "prepro" or "pro" form of the . 
protein may also be used to specify protein targeting, folding, and/or activity. Different host cells 
which have specific cellular machinery and characteristic mechanisms for post-Oxnstational activities 
(eg.. CHO, HeLa, MDCK, HEK293. and W138) are available from the American Type Culture 
Collection (ATCC. Manassas VA) and may be chosen to ensure the correct modifkaiion and processing 
of the foreign protein 

In another embodiment of the Invention, natural, inodi fieri, or recombinant nucleic acid 
sequences encoding GCREC may be Ugated to a heterefcagcag sequence resulting in translation of a 
fusion protein In any of the aforementioned host systems. For example, a chimeric GCREC protein 
containing a heterologous moiety that can be recognized by a commercially available antibody may 
facilitate the screening of peptide libraries for Inhibitors of GCREC activity. Heterologous protein and 

e affinity 



binding protein (MBP), throredoxin (Trx), calmodulin binding peptide (CBP). 6-His, FLAG, c-myc. and 
Ijemagglutinin (HA). GST. MBP, Trx. CBP. and 6-His ens 



respectively. FLAG, c-myc, and hemagglutinin (HA) em 
proteins using commercially available monoclonal and poryctonal a 
these epitope tags. A fusion protein may also be enginee 
located between the GCREC encoding sequence and the heterologous protein sequence, so that GCREC 
may he cleaved away from the heterologous moiety foftowing purifKation. Methods for fusion protein 
expression and purification are discussed in AusubeJ (1995. supra, ch 10). A variety of commercially 
available kits may also be used to facilitate expression and purification of fusion proteins. 
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In a further embodiment of the insertion, synthesis of racSolabek^^^CREC nay be aexscved ji) 
vrtrp using the TNT rabbit reticulocyte fysate or wheat germ anna system (Prccnega). These systems 
couple transcription and translation of protein-coding sequences operably associated with the T7. T3, Of 
SP6 promoters, Translation takes place to the pres en ce of s radiolabeled amino add precursor, tor 
5 a ample, "S-roethio n i rr . 

GCREC of the present invention or fragments thereof may be used to screen for compounds 
that specifically bind to CCREC. At least one and up to a plurality of test compounds may be 
screened for specific binding to GCREC. Examples of test compounds include antibodies, 
digonucleotfdes, proteins (eg, receptors), or small molecules. 

10 In one embodiment, the compound thus identified is closely related to the natural Ugand of 

GCREC, e.g , a Ugand or fragment thereof, a natural substrate, a structural or functional mimetic, or a 
natural binding partner. iSee. c«_ CoUtan. JJL et 1. f 19911 Current Protocols i n Immunology 1(1): 
Chapter 5.) Similarly, Che compound can be dosely related to the natural receptor to which GCREC 
binds, or to at least a fragment of the receptor, eg, the Ugand binding she. In either case, the 

IS co m pound can be r at i on a ll y d esigned using known techniques. In one embodiment, screening for 
these compounds involves producing appropriate cells which express GCREC, either as a secreted 
protein or on the cell membrane. Preferred celts include ceDs from mammals, yeast, p rpsophila. or £, 
coli. Cells expressing GCREC or ceil membrane fractions which contain GCREC are then contacted 
with a test compound and binding, stimulation, or inhibition of activity of either GCREC or the 

20 compound is analyzed. 

An assay may simply test binding of a test compound to the polypeptide, wherein binding is 
detected by a fluoropbore, radioisotope, enzyme conjugate, or other detecta b le label For example, 
the assay may comprise the steps of combining at least one lest compound with GCREC, either in 
solution or affixed to a solid support, and detecting the bi n di n g of GCREC to the compound. 
) 23 Alternatively, the assay may detect or measure binding of a test compound in o« presence of a 

labeled competitor. Additionally, the assay may be carried out using cell-free preparations, chemical 
libraries, or natural product mixtures, and the test compoondXs) may be free in solution or affixed to a 
solid support. 

GCREC of the present invention or fragments thereof may be used to screen for compounds 
30 that modulate the activity of GCREC. Such compounds may include agonists, antagonism, or partial 
or inverse agonists. In one embodiment, an assay is performed under conditions permissive for 
GCREC activity, wherein GCREC is combined with at least one test compound, and the activity of 
GCREC in the presence of a test compound is compared with the activity of GCREC in the absence of 
the test compound. A change In the activity of GCREC In the presence of the test compound is 
35 indicative of a compound that modulates the activity of GCREC. Alternatively, a test compound is 

3d 
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e for GCREC 

activity, and the assay is performed. In either of these assays, a test compound which modulates the 
activity of GCREC may do so indirectly and need not come in direct contact with the test compound. 
At lea st one and up to a plurality of test compounds may be screened. 

5 In another embodiment, polynucleotides encoding GCREC or their mammalian hornctogs may 

be "knocked out" in an animal model system using homologous recombination in embryonic stem (ES) 
cells. Such mrtwnq it rs are well known in the an and are useful for the generation of animal models of 
human disease. (See, eg. OS. Patent Number 5.175.383 and US. Patent Number 5,767,337.) For 
exampte, mouse ES ceils, such as the mouse 1 29/SvJ ceil line, are derived from the carry mouse embryo 

10 and grown in culture. TheES celh: are transformed with m vrrrry cOT^ ainingthr gene of Irttrest 

disrupted by a marker gene, eg., the neomycin phusphorransferase gene (neo, Capecchi, M.R. (1989) 
Science 244:1 284-1 292). The vector Integrates into the corresporeflng region of the host genome by 
tornologous recombination. Alternatively, homologous recombination takes place using the CnMoxP 
system to knockout a gene of interest In a tissue- or de^oprnental stage-specific manrex (Mirth, jj>. 

15 (1996) Clin. Invest. 97:1999-2002; Wagner, K.U. et at (1997) Nude* Acids Res. 25:4323-4330). 



the C57BL/6 mouse strain. The blastocysts are surgically Transferred to pxrudopregnant d 
resulting chimeric progeny are genofyped and bred to produce heterozygous or hornozygou 
Transgenic animals thus generated may be tested with potential therapeutic or toxic agercs. 
20 PdynockxXlOCs encoding GCREC may also be manipulated In vitro In ES ceOs derived from 

human blastocysts. Human ES cells have the potential to differentiate into at least eight separate cell 
lineages including endoderin, rnesoderm, and ectodermal rxfl types. These cell Uncages difforentiaie 
into, for example; neural cells, hematopoietic lineages, and cardiornyocytes (Thomson, J.A a al. (1998) 
Science 282:1145-1147). 

25 Polynucleotides encoding GCREC can also be used to create Toiockin" humanized ardmats 

(pigs) or transgenic animals (mice or rats) to model human disease. With knockin technology, a region 
of a polynucleotide encoding GCREC is injected into animal ES ceils, and the injected sequence 



of a hurra 

il inbred to overexprcss GCREC. eg. by secreting GCREC in its milk, may also serve as a 
a source of that protein (Jaime, J. et al. (1998) Bioechnol. Anna. Rev. 435-74). 
THERAPEUTICS 

Chemical and structural similarity, eg. in the context of sequences i 
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between regions of GCREC ind G-prottin coupled receptors. Therefore, GCREC appears to play a 
role In ceU proliferative, neurological, cardiovascular, gastrointestinal, auldmmunc/infliimnatory. and 
metabolic disorders, and viral infections. In the treatment of disorders associated with increased 
GCREC expression or activity, it Is desirable to decrease the expression or activity of GCREC In the 
5 treatment of disorders associated with decreased GCREC expression or activity, it is desirable to 
increase the expression or activity of GCREC 

Therefore, in one embodiment GCREC or a fragment or derivative thereof may be 
administered to a subject to treat or prevent a disorder associated with decreased expression or 
activity of GCREC Examples of such disorders Include, but are not limited to, a cell proliferative 

10 disorder such as actinic keratosis, arteriosclerosis, atherosclerosis, bursitis, drrbosfs, hepatitis, mixed 
connective tissue disease (MCTD). myelofibrosis, paroxysmal n 
polycythemia vera, psoriasis, primary thrombocythemia. and a 
leukemia, lymphoma, melanoma, myeloma, sarcoma, teratocordnoroa, and, in particular, cancers of 
the adrenal gland, bladder, bone, bone marrow, brain, breast, cervix, gall bladder, gangtia, 

IS gastrointestinal tract, heart, kidney, liver, lung, muscle, ovary, pancreas, parathyroid, penis, prostate, 
salivary glands, skin, spleen, testis, thymus, thyroid, and uterus: a neurological disorder such as 
epilepsy, ischemic cerebrovascular disease, stroke, cerebral neoplasms, Alzheimer's disease. Pick's 
disease, Huntington's disease, dementia, Parkinson's disease and other extrapyramidal disorders, 
amyotrophic lateral sclerosis and other motor neuron disorders, progressive neural muscular atrophy, 

20 retinitis pigmentosa, hereditary ataxias, multiple sderosis and other demydlnating dis ea ses, bacterial 
and viral meningitis, brain abscess, su bdural empyema, epidural a b scess, suppurative intracranial 
thrombophlebitis, myelitis and radiculitis, viral central nervous system disease, prion diseases 
including fcuru, CreutzfcMt-Jakob disease, and Gerstmann-Stiai 




derrnatomyostti* and polymyositis, inherited, metabolic, endocrine, and toxic myopathies, myasthenia 
30 gravis, periodic paralysis, mental disorders including mood, anxiety, and schizophrenic disorders, 
seasonal affective disorder (SAD), akathesia, amnesia, catatonia, diabetic neuropathy, tardive 
dyskinesia, dystonias, paranoid psychoses, postherpetic neuralgia, Tourette's disorder, progressive 

ir palsy, coni co basal degeneration, and familial frontotemporal dementia; a cardiovascular 
rteriovenous fistula, atherosclerosis, hypertension, vasculitis, Raynaud's disease, 
35 aneurysms, arterial dissections, varicose veins, thrombophlebitis and pWebotiirombosis. vascular 



artery bypass graft surgery, congestive heart failure, ischemic h 
myocardial infarction, hypertensive heart disease, degeneratlvt 
valve stenosis, congeni tally bicuspid aortic valve, mitral a 
5 rheumatic fever and rheumatic heart disease, infective endocarditis, nonbacterial thrombotic 

endocarditis, endocarditis of systemic lupus erythematosus, carcinoid heart disease, cardiomyopathy, 
myocarditis, pericarditis, neoplastic heart dis ease , congenital heart disease, and complications of 
cardiac transplantation; a gastrointestinal disorder such as dysphagia, peptic esophagilis. esophageal 
spasm, esophageal stricture, esophageal card noma, dyspepsia, indigestion, gastritis, gastric 
10 carcinoma, anorexia, nausea, emesis, gastrop&rcsis, antral or pyloric ede ma , abdominal angin a. 



pyrosis, gastroenteritis, intestinal obstruction, infections of the li 



hepatitis, hyperbilirubinemia, cirrhosis, passive congestion of the liver, hepatoma, infectious colitis, 
ulcerative colitis, ulcerative proctitis. Crohn's disease, Whipple's disease, MaTtory- Weiss syndrome, 
colonic card noma, colonic obstruction, irritable bowel syndrome, short bowel syndrome, diarrhea, 
constipation, gastrointestinal hemorrhage, acquired immonodefidency syndrome (AIDS) enteropathy, 
jaundice, hepatic encephalopathy, hepatorenal syndrome, hepatic s t eatos i s, hemochromatosis, 
Wilson's disease, alpha^antilrypsin deficiency. R eye's syndrome, primary sd erasing cholangitis, 
liver Infarction, portal vein obstruction and thrombosis, centri lobular necrosis, peliosis bepatia, 
hepatic vein thrombosis, veno-ocdusive disease, preeclampsia, eclampsia, acute fatty liver of 

odularhypej 




25. hemolytic anemia, 

dystrophy (APECED), b 



X dermatitis, Crohn's disease, atopic dermatitis, 
ic rymphopeda with lyrnphocytrxcoirts. 



erythroblastosis fetalis, erythema nodos u m, atrophic gastritis, glomerulonephritis. Goodpasture's 
syndrome, gout. Graves' disease, Hashimoto's thyroiditis, hypereodixxphula. irritable bowd 

30 syndrome, multiple sderosis, myasthenia gravis, myocardial or pericardial Inflammation, 

osteoarthritis, osteoporosis, pancreatitis, polymyositis, psoriasis, R titer's syndrome, rheumatoid 
arthritis, sderoderma, Sjogren's syndrome, systemic anaphylaxis, systemic lupus erytheraatosus, 
syste m ic sclerosis, thrombocytopenic purpura, ulcerative colitis, uveitis, Werner syndrome) 
complications of cancer, hemodialysis, and extracorporeal circulation, viral, ba ct erial, fungal. 

35 parasitic, protozoal, and helminthic Infections, and trauma; a metabolic disorder such as diabetes. 
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bunya virus, calidvirus, ccrooavirus, Glcvirus, hcpadn avlnis, herpesvirus, flavi virus, orthomyxovirus, 
parvovirus, papova virus, paramyxovirus, plcorna virus, poxvirus, reovirus, retrovirus, i h abdbvirus, 
and tonga virus. 

5 Jn imothcr ernbodinxaX. a vector capable erf expressing GCREC or a fragment or derivative 

thereof may be administered to a subject to treat or prevent * disorder associated with decreased 
expression or activity of GCREC Including, but oot limited to, those described above. 

In a runner embodiment, a conposttian comprising a substantially purified GCREC in 
conjunction with a suitable pharmaceutical carrier may be administered lo a subject to treat or prevent a 
10 disorder associated with decreased expression or activity of GCREC including, but not limited to, 
those provided above. 

In still another embodiment, an agonist which modulates the activity of GCREC may be 
ad min istered to a subject to treat or prevent a disorder asso ci a te d with decreased expression or activity 
of GCREC including, but not limited to. those listed above. 

15 In a further embodiment, an antagonist of GCREC may be administered to a subject to treat or 

prevent a disorder associated with increased expression or activity of GCREC Examples nf such 
disorders include, but are not limited to, those cell proliferative, neurological, cardio vas cular, 
gastrointestinal, autoimmune/mOamnutory, and metabolic disorders, and viral infections, described 
above. In one aspect, an antibody which specifically binds GCREC may be used directly as an 

20 antagonist or indirectly as a targeting or delivery mechanism for bringing a pharmaceutical agent to 
ceDs or tissues which express GCREC. 

In on additional embodiment, a vector expressing the c o mp l em ent of the polynucleotide 
encoding GCREC may be administered to a subject to treat or prevent a disorder asso ci a te d with 
increased expression or activity of GCREC including, but not limited to, those described above. 

25 In other embodiments, any of the proteins, antagonists, antibodies, agonists, complementary 

sequences, or vectors of the invention may be administered in combination with other appropriate 
therapeutic agents. Selection of the appropriate agents for use in combination therapy may be made by 
one of ordinary skill in the art. according to conventional pharmace uti cal principles. The combination 
of therapeutic agents may act synergisticafiy to effect the treatment or prevention of the various 

30 disorders described above. Using this ipproach, one may be able to achieve therapeutic efficacy with 
lower dosages of each agent, thus reducing the potential for adverse side effects. 

An antagonist of GCREC may be produced using methods which are generally known in the 
an. In particular, purified GCREC may be used to produce antibodies or to screen libraries of 
pharmaceutical agents to identify those which specifically bind GCREC Antibodies to GCREC may 
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also be generated using methods thai am «rf known m the an. Such antibodies may indude, but are 
not limited to, polyclonal, monoclonal, crorncrjc, tnd tingle chain anuTwdks, Fab fragments, and 
fragments produced by a Fab expression library. Neutralizing antibodies fi e., those which inhibit 
df mer formation) are generally preferred for therapeutic use. 
5 For the production of antibodies, various hosts »«<:rwllng goats, rabbits, rats, mice, humans, 

and others may be immunized by injection with GCREC or with any fragment or oligopeptide thereof 
which has immunogenic properties. Depending cm the host species, various adjuvants may be used to 
increase immunological response. Such adjuvants include, but are not limited Ux Freuwfs, mineral gets 
such as aluminum hydroxide, and surface active substances such as hysohxsthia, pturonic polyots, 

10 poh/aniocs. peptides, oD emulsions. KLH. and dinltjophenoi. Among adjuvants used in humans, BCG 
(baemi Calmette-Guerin) and Corvnebaaerium parvum ore especially preferable. 

It Is preferred that the otigopeptides, peptides, or fragments used to induce antibodies to 
GCREC have an amino acid sequence consisting of at least about 5 amino acids, arxJ generally will 
consist of at least about 10 amino acids. It is also preferable that these otigoceptides, peptides, or 

15 fragments are identical to a portion of the amino acid sequence of the natural protein. Short stretches of 
GCREC amino adds may be fused with those of another protein, such as KLH. and antibodies to the 
chimeric molecule may be produced. 

Monoclonal antibodies uTGCREC may be prepared using any technique which provides for the 
production of antibody molecules by continuous ceU lines in culture. These include, but are not limited 

20 to, the hybridorna technique, the human B-cdl hybridoma technique, and the EBV-trytxidaina 
technique. (See, eg.. Kohler. O. et al (1975) Nature 256:495-497; Kozbor. D. el al. (1985) J. . 
lmjnunoL Methods 81 31-42; Cote, RJ. et al (1983) Proc Natl Acad. Set USA 8&2026-2O3O; and 
Cole, SJ». et al. (1984) Mo). Cell Biol. 62:109-120.) 

In addition, techniques developed for the production of "chimeric antibodies," such as the 

25 splicing of mouse antibody genes to human antibody genes to obtain a molecule with appropriate 
antigen specificity and biological activity, can be used. (See, eg., Morrison. S.L. et al. (1984) Proc 
Natl Acad. Sd. USA 81:685 1-6855; Neuberger. M.S. et al. (1984) Nature 312:604-608; KidTakeda, 
S. et al. (1985) Nature 314:452-454.) Alternatively, techniques described for the prcdoction of single 
chain antibodies may be adapted, using methods known in the art, to produce GCREC -specific single 

30 chain antibodies. Antibodies with related speciQciry, but of distinct idiotypic ccrrraposition, may be 
generated by chain shuffling from random combinatorial Immunoglobulin libraries. (See, eg.. Burton, 
D.R. (1991) Proc Natl. Acad. Sd. USA 88:10134-10137.) 

Antibodies may also be produced by inducing in vivo production m the rymphocyte population 
or by screening inununoglobulin libraries or panels of highly specific binding reagents as disclosed in 
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the literature. (See, eg. Ortandi, R. et al. (1989) Proc Nail Acad. Set USA 86:3833-3837; Winter, 
G. et al. (1991) Nature 349:293-299.) 

Antibody fragments which contain specific binding sites for GCREC may also be generated. 
For example, such fragments include, but are not limited to, F(ab>, fragments produced by pepsin 
5 digestion of the antibody m olec u le and Fab fragments generated by reducing the disulfide bridges of the 
F(ab")2 fragments. Alternatively. Fab expression libraries may be constructed to allow rapid and easy 
identification of monoclonal Fab fragments with the desired specificity. (See eg., Huje, W.D. a al. 
(1989) Science 246:1275-1281.) 

Various Immunoassays may be used for screening to identify antibodies having the desired 

10 Specificity. Numerous protocols for competitive binding or immunoradiomctric assays using either 
poryctonal or monoclonal antibodies with established specificities are wen known in the art. Such 
imm u n oassays typically involve the measurement of complex formation between GCREC and its 
specific antibody. A two- site, monoclonal-based immunoassay utilbting monoclonal antibodies reactive 
to two ncm-interfcrmg GCREC epitopes is generally used, but a competitive binding assay may also be 

15 employed (Pound, moral . 

Various methods such as Scat chard analysis in conjunction with ladlcdmrnunoassay techniques 
may be used to assess the affinity of antibodies for GCREC. AfTmiiy is expressed as an association 
constant, which is defined as the molar concentration of GCREC -antibody complex divided by the 
molar cc»icenrxatiora of free antigen and free antibody under equilibrium ccaadtaons. The K, determined 

20 for a preparation of polyclonal antibodies, which are heterogeneous in their affinities for multiple 
GCREC epitopes, represents the average affinity, or avidity, of the antibodies for GCREC. The K, 
determined for a preparation of monoclonal antibodies, which are monospecific for a particular 
GCREC epitope represents a true measure of affinity . High-affinity antibody preparations with K. 
ranging from about 10* to \0° L/mole are preferred for use in hnnmnoassays in which the GCREC- 

25 antibody complex must withstand rigorous manipulations. Low-am nity antibody preparations with K, 
ranging from about 10* to 10 T L/male are preferred recuse fa imrrwnopurification and similar 
procedures which ultimately require dissociation of GCREC. preferably In active form, from the 
antibody (Catty, D. (1988) AmibotBes. Volume I: A Practical Approach. IPX Press, Washington DC; 
LkWen,J£. and A Cryer (1991) A Practical Ctfjtie to Monoclonal Anybodies. John Wiley A Sons, 

30 New York NY). 

The titer and avidity of polyclonal antibody preparations may be further evaluated to determine 
the quality and suitability of such preparations for certain downstream applications. For example a 
polyclonal antibody preparation containing at least 1-2 mg specific intibody/ml, preferably 5-10 mg 
specific antibody/ml, is generally employed In procedures requiring precipitation of GCR EC-antibody 
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complexes. Procedures for evaluating antibody specificity, titer, and avidity, and guidelines for 
antibody quality and usage in various applica ti ons, are generally available (See eg.. Catty, supra , and 
Cougan et al. supra .) 

In another embodiment of the invention, the porynudeotides encoding GCREC, or any fragment 
5 or complement thereof, may be used for therapeutic purposes. In one aspect, modifications of gene 
expression can be achieved by designing asnptanenttry sequences or antisense molecules (DNA, RNA, 
PNA, or modified oligonucleotides) to the coding or regulatory regions of the gene encoding GCREC. 
Such technology is well known in the art. and antisense c^Eonuclootides or larger fragments can be 
designed from various locations along the coding or control regions of sequences encoding GCREC 

1 0 (See, eg., Agrawal, S., ed. (1 996) Antisense Therapeutics . Humana Press Inc. Totswa N J .) 

In therapeutic use any gene delivery system suitable for introduction of the antisense 
s eq u e n ces Into appropriate target ce lls can be used. Antisense sequences can be delivered 
intraceDularty in the form of an expression p l as m id which, upon transcription, produces a sequence 
cornplernentary to at least a portion of the cellular sequence encoding the target protein. (See eg.. 

15 Slater. J JE. et al. (1 998) J. Allergy Cli. Immunol. 1 02(3):469-475; and Scanlon, KJ. et al. (1 995) . 
9(1 3): 1288- 1 296.) Antisense sequences can also be introduced irrtracelmlarly through the use of viral 
vectors, such as retrovirus and adeno- associated virus vectors. (See eg., Miller, A.D. (1990) Blood 
76:271 ; Ausnbel. supra: Uckert, W. and W. Walther (1994) Pharmacol. Ther. 63(3):323-347.) Other 
gene delivery mechanisms include liposome -derived systems, artificial viral envelopes, and other 

20 systems known in the art. (See eg.. Rossi. J J. (1995) Br. Med. Bun. 51(0:217-225; Boado. RJ. et 
al. (1998) J. Pnarm. Sd. 87(1 1): 1308- 1315; and Morris, M.C. et al. (1997) Nuddc Acids Res. 
25(1 4): 2730-2736.) 

In another embodiment of the invention, polynucleotides encoding GCREC may be used for 
s o m a ti c or germline gene therapy. Gene therapy may be performed to G) correct a ge ne ti c deficiency 
' 25 (eg, in the cases of severe combined immuncrfefldency (SCIT>)-X1 disease characterized by X-linked 
inheritance (Cavazzana-Calvo, M. et al. (2000) Science 288:669-672). severe combined 
immunodeficiency syndrome associated with an inherited adenosine dea mi n ase (ADA) deficiency 
(Blaese R-M. el al (1995) Science 270:475-480; Bordignon, C. et al. (1995) Science 270:470-475), 
cystic fibrosis (Zabner. J. et al (1993) CeU 75 JD7-2I6; Crystal. R.G. et al. (1995) Hum. Gene 
30 Therapy 6:643-666; Crystal, R.G. et al (1995) Hum. Gene Therapy 6:667-703). thalassemias, familial 
hyperchotenerafcmia, and hemophilia resulting from Factor VU1 or Factor IX deficiencies (Crystal. 
R.G. (1995) Science 270:404^410; Verm*. l.M. and N. Somia (1997) Nature 389:239-242)), (U) 
express a conditionally lethal gene product (eg., (n the case of cancers which result from unregulated 
cell proliferation), or (iii) express a protein which affords protection against ittracethitar parasites (eg. 
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Nature 335395-396; Pcesebla, tail (1996) Proc. Natl. Ac 
hepatitis B or C virus (HBV.HCV); fungal parasites, such as 



s (Hl^WSmore, D. (19*8) 
L Set USA. 93:1 1395-1 1 399X 



case where i genetic deficiency In CCREC expression or regulation 
GCREC from an appropriate pcpi 



s disease, the expression of 



GCREC arc treated by constructing mammalian expression vectors encoding CCREC and Introducing 
iSt-r>- vectors by mechanical means into GCREC -deficie» ceils. Mechanical transfer technologies for 
use with ceils tn vivo or vitro iorfutfc 01 cfirect DNA microinjection Into individual cells, (ii) ballistic 
gold panicle delivery. (Ill) liposome- mediated transection. (Iv) receptor-nasdiated gene transfer, and (v) 
the use of DNA transposons (Morgan, RA. and WE Anderson (1993) Anno. Rev. Bkxhem. 62:191- 
217; Ivies, Z. (1997) CeD 91 J01-5IO; Boulay, J-l_ and H. Reopon (1998) Curr. Opln. Biotechnoi. 
9*45-450). 

Expression vectors that may be effective for the expression of GCREC include, but are not 
limited to. the PCDNA 3.1 . EFTTAG. PRCCMV2. PREP. PVAX vectors anvirxogen, Carlsbad CA). 
PCMV-SCRIPT, PCMV-TAG, PEGSH/PERV (Stratagene, La Jolla CA), and PTET-OFF, 
PTET-ON, PTRE2. PTRE2-LUC, PTK-HYG (Oontrch. Palo Alto CA). GCREC may be expressed 
using (I) * constitutrvdy active promoter. (e.g-. from cytomegalovirus (CMV), Rous sarcoma virus 
(RSV), SV40 virus, thymidine kinase (TK), or B-acUn genes), (ii) an inducible promoter (eg., the 
tetracyeUne-regulated promoter (Gossen, M. and H. Bujard (1992} Proc Natl. Acad. Set USA 
89:5547-5551: Gossen, M. et al. (1995) Science 268:1766-1769; Rossi, F.M.V. and H.M. Blau (1998) 
Cott. Opto. Bkxechnoi. 9:451*456). co m mercially available In the T-REX plasmid (lnvitrogen)); the 
ecdysone-inducible promoter (available m the ptasnnds PVGRXR and PIND, lnvitrogen); the 
FK506/rapamycin inducible promoter; or the RU486/na fe prts to ne inducible promoter (Rossi, F.M.V. 
and Blau, H.M. supra) ! or (1H) a tissue-specific promoter or the native promoter of the endogenous 
gene encoding GCREC from a normal individual. 

Commercially available liposome transformation kits (eg., the PERFECT LIPID 
TRANSFECTfON KIT, available from lnvitrogen) allow one with ordinary tkjll in the art to deliver 
polynucleotides to target cells In culture and require minimal effort to optimize experimental 
the alternative, transformation is per fort 
t, FX- and AJ. Eb (1973) Virology 52:456467). or b 
(1982) EM BO J. 1 :841-845). The introduction of DNA to primary cells requires n 
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la another embodiment of the invention, dJ 
r e s p ect to GCREC expression are treated by constructing a retrovirus v 
poly n u cle otid e encoding GCREC under the control of an i nde^xjidetal pro m o te r or the retrovirus long 
terminal repeat (LTR) promoter. (H) appropriate RNA packaging signals, and Oil) a Rev-responsive 
element (RRE) along with additional retrovirus cir- acting RNA sequences and coding sequences 
required for efficient vector propagation. Retrovirus vectors (eg.. PFB and PFBNEO) arc 
commercially available (Stratagene) and are based on published data (Riviere. 1. et al. (1995) Proc 
Natl. Acad. Sd. USA 92.6733-6737), incorporated by reference herein. The vector Is propagated In an 
appropriate vector producing cell line (VPCL) that expresses an envelope gene who a troptan for 
receptors on the target cells or a promiscuous envelope protein such as VSVg (Anneniano, D. et al 
(1987) J. Virol. 61:1647-1650; Bender, MA et aL (1987) J. Virol. 61:1639-1646; Adam, M.A. and 
AD. Miller (1988) J. Viral. 62:3802-3806; Dull, T. et al (1998) J. VfroL 72.-8463-8471; ZufTercy. R. 
et aL (1998) J. Virol. 72.-9873-9880). US. Patent Number 5.910.434 to Rigg ("Method for obtaining 
retrovirus packaging cell lines producing high transducing efficiency retroviral supernatant") discloses a 
method for obt a in i n g retrovirus pack a gin g ceil ones and is hereby incorporated by reference. 
Propagation of retrovirus vectors, transduction of a population of ceils (eg., CD4* T-cefls), and the 
return of transduced cells to a patient are procedures well known to persons skilled tn the art of gene 
therapy and have been weD documented (Ranga, U. et al, (1997) J. VfroL 71:7020-7029; Bauer, G. et 
aL (1997) Blood 89:2259-2267. Borryhadi, MX. (1997) I. Virol 71:4707-4716; Ranga, U. et al 
(1998) Proc Natl Acad Sd. USA 95:1201-1206; Su, L. (1997) Blood 89:2283-2290). 

In the alternative, an adenovirus -based gene therapy delivery system is used to deliver . 
polynucleotides encoding GCREC to ceils which have one or more genetic abnormal it Irs with respect to 
the expression of GCREC. The construction and packaging of odenovlrus-based vectors are well 
known to those with or dinar y skin in the art. Replication defective adenovirus vectors have proven to 
be versatile for importing genes encoding immuncregutatory proteins Into intact Islets to the pancreas 
(Csete. ME. et al. (1 995) Transplantation 27:263-268). Potentially useful adenoviral vectors are 
described in U.S. Patent Number 5,707,618 to Anneniano C Adenovirus vectors Tor gene therapy"), 
hereby Incorporated by reference For adenoviral vectors, see also Antinoizi, P. A, et aL (1999) Annu. 
Rev. Nufr. 19 3\ 1-544 and Verma. LM. and N. Somia (1997) Nature 18389:239-242. both 
incorporated by reference herein. 

In another alternative, a herpes-based, gene therapy delivery system is used to deliver 
polynucleotides encoding GCREC to target cells which have one or more genetic abnormalities with 
respect to the expression of GCREC. The use of herpes simplex virus (HS V)-bascd vectors may be 



WO 01/57085 PCT/US01/U3455 

especially valuable for introducing GCREC to cells of the centra] nervous system, for which HSV has a 
tropism. The construction and packaging of herpes-based vectors are weD known to those wim 
ordinary skill in the art. A repUcation-coira^etent herpes simplex virus (HSV) type 1 -based vector has 
been used to deliver a reporter gene to the eyes of primates (Liu. X. et al. (1999) Exp. Eye Res. 
5 169-385-395). The construction of a HSV-1 virus vector has also been disclosed in detail in U.S. 
Patent Number 5,804,413 to DeLuca ("Herpes simplex vims strains for gene transfer-), which is 
hereby incorporated by reference. U.S. Patent Number 5.804.413 teaches the use of recombinant HSV 
d92 which consists of a genome containing at least one exogenous gene to be transferred to a cril under 
the control of the appropriate promoter for purposes including human gene therapy. Also taught by this 

10 patent are the construction and use of recombinant HSV strains deleted for 1CP4. 1CP27 and ICP2Z 
For HSV vectors, see afro Corns, W.F. et al (1999) f. Virol. 73319-532 and Xu, H. et al. (1994) Dev. 
Biol. 163:152-161, hereby Incorporated by reference. The manipulation of cloned herpesvirus 
sequences, the generation of reco m b in a nt virus following the transfection of multiple ptassnJds 
containing different s e g m e n ts of the large herpesvirus g enomes , the growth and prop a g a t ion of 

13 herpesvirus, and the infection of cells with herpesvirus are techniques wdl known to those of ordinary 
skill in the art. 

In another alternative, an alpha virus (positive, single-stranded RNA virus) vector is used to 
deliver pc*ym>deoti0es encoding GCREC to target cells. The Otology of the prototyplc alpha virus, 
Semllki Forest Virus (SFV), has been studied extensively and gene transfer vectors have been based on 

20 the SFV genome (Garoff.H. and K.-J.U (1998) Curr. Opic Btolechnol. 9:464-469). During 

alpha virus RNA replication, a robgenotroe RNA is generated that normally encodes the viral capsid 
proteins. This subgenomic RNA replicates to higher levch than the full length genomic RNA. resulting 
in the overproduction d capsid proteins relative to the viral proteins with enzymatic activity (eg,, 
protease and polymerase). Similarly, Inserting the coding sequence for GCREC into the alpha virus 

25 genome In place at the capsid-codlng region results la the production of a large number of GCREC- 
cocfing RNAs and the ryishesis of high levels of GCREC In vector transduced cells. While alpha virus 
Infection Is typically associated with cell lysis within a few days, the ability to establish a persistent 
infection In hamster normal kidney cdfr (BHK-21) with a variant of Smdbis vfrus (SIN) indicates that 
the lytic rep l ica tio n of alpha viruses can be altered to soil the needs of the gene therapy application . 

30 (Dryga, S.A et al. (1997) Virology 228:74-83). The wide host range of alpha viruses wm allow the 
introduction of GCREC mto a variety of cell types. The tpedflc transduction of a subset of ceils In a 
popul at ion may require the sorting of cells prior to transduction. The methods of m anipu l ating 
infectious cDNA clones of alpha viruses, performing alpha virus cDN A and RNA transections, and 
performing alpha virus Infections, are well known to those with ordinary skill In the art. 
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Oligonucleotides derived from the transcription initiation site eg., between about positions -10 
and + 10 from the start site, may also be employed to Inhibit gene expression. Similarly; inhibition can 
be achieved using triple helix base-pairing methodology. Triple helix pairing is useful because it causes 
Inhibition of the ability of the double helix to open sufficiently for the binding of polymerases, 
5 transcription factors, or regulatory molecules. Recent thera p e u tic advances using triplex DNA have 
been described in the literature. (See, eg.. Gee, J.E et al. (1994) in Hobcr, BE. and B.l. Can-, 
Molecular and Immunologic Antwoachcs. Futura Pubusfrwg, ML KlscoNY. pp. 163-177.) A 
complementary sequence or antisertse molecule may also be designed to block translation of mRNA by 
preventing the transcript from binding to ribosomes. 
10 Ribozymes, enzymatic RNA molecules, may also be used to catalyze the specific cleavage of 

RNA. The mechanism of ribuzyme action Involves sequence-specific hybridization of Che rfbazyme 
molecule to complementary target RNA, followed by endonuclecJytic cleavage. For example, 
engineered hammerhead motif ribozyme molecules may specifically and e ffic ie n tly catalyze 
endonudeorytic cleavage of sequences e nc odi n g GCREC 
1 5 Specific rihozyme cleavage sites within any potential RNA target are initially Identified by 

scanning the target molecule for ribozyme cleavage sites, inducting the following sequencer. QUA, 
GUU.aadGUC Once identified, short RNA sequences of between 15 and 20 ribonucleotides, 
corresponding to the region of the target gene containing the cleavage site, may be evaluated for 
secondary structural features which may render the oligonucieoticle Inoperable. The suitability of 
20 cancSdate targets may also be evaluated by testing accessibility to hybridization with ccanplementary 
oligonucleotides using ribo n ud e asc protection assays. 

Complementary ribonucleic add motccoles and rfbozymea of the invention may be prepared by 
any method known tn the art for the synthesis of rmctric add molecules. These include ttcfr/aqnes for 
chemically synthesizing ollgotwdeotides such as solid phase phosphoramkllte chemical synthesis. 
25 Altematlveiy, RNA molecules may be generated by rn vitro and In vivo transcription of DNA sequences 
encoding OCREC. Such DNA sequences may be incorporated Into a wide variety of vectors with 
suitable RNA polymerase promoters such as T7 or SP6. Alternatively, these cDNA constructs that 
synthesize complementary RNA. constitutivety or indudbry, can be Introduced Into cdJ lines, cells, or 
tissues. 

30 RNA molecules may be modified to Increase Intracellular stability and half-lift Possible 

; inodUlcationi Include, hut are dpi Hinted to, the a ddi tion of flanking seque nce s at the 5 'and/or 3* cods 

j of the t n okc ul e. or the use of pho&pbrrothioate or 2' O- methyl rather than ylMJ^JiOda m n ase l ink a ges 

within the backbone of the molecule. Ttris concept is Inherent In the production of PN As and can be - 
extended In all of these molecules by the I n c lusi on of nootracfitional bases such as taostae, queen I ne . and 
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wybtucdne. as well as acetyl, methyl-, ihir>, and similarly 
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An additional embrjdimcnt of the Invention encompasses a method for screening for a 
compound whkb is effective in altering expression of a polynucleotide encoding CCREC. 
Compounds which may be effective In altering expression of a specific poly n ucleotide may include, 
but are not limited to, oligonucleotides, antisense oligonucleotides, triple helix-forming 
oligonucleotides, transcription factors and other polypeptide transcriptional regulators, and non- 
s which are capable of Interacting with specific p 
s may alter polynucleotide expression by a 
promoters of polynucleotide expression. Thus, in the treatment of disorders associated with increased 
CCREC expression or activity, a compound which specifically inhibits expression of the 
polynucleotide encoding CCREC may be therapeutically useful, and in the tr 
assoclated with decreased CCREC expression or activity, a compound 
expression of the polynucleotide encoding GCREC may be therapeutically useful. 

At least one, and op to a plurality, of test compounds may be screened for effectiveness in 
altering expression of a specific polynucleotide. A test compound may be obtained by any method 
commonly known in the art. including chemical modification of a compound known to be effective In 
altering polynucleotide expression; selection from an existing, commercially -available or proprietary 
il compounds; rational d es ig n of a compound 
i) properties of the target polynucleotide*, and selection from a 



polynucleotide encoding CCREC is exposed to at least one test compound thus obtained. The sample 
may co m p ris e, for example, an intact or pcr ro cabt l ized cell, or an In vitro cell-tree or reconstituted 
biochemical system. Alterations in the expression of a polynucleotide en c odi ng CCREC are assayed 
by any method commonly known in the art- Typically, the expression of a specific nucleotide is 
detected by hybridization with a probe having a nucleotide seque n ce complementary to the sequence 
of the poTynucleotide encoding CCREC The amount of hybridization may be Quantified, thus 
forming the basis for a comparison of the expression of the polynucleotide both with and without 
exposure to one or more lest compounds. Detection of a change in the expression of a polynucleotide 
exposed to a test compound indicates that the test compound Is effective in altering the expression of 
Che polynucleotide. A screen for a compound effective in altering exp re ss ion of a specific 
polynucleotide can be carried out, for example, using a Scrfl-pjsaccharprnvces pombe gene expression 
system (Atkins, D. el al. (1999} U.S. Patent No. 5.932,433; Aradt, CM. « al. (2000) Nucleic Adds 
Rex. 28:E15) or a human cell line such as HeLa cell (Clarke, Ml- et al. (2000) Biocfaem. Biophys. 
Res. Commun. 268:8-13). A particular embodiment of the present Invention invol 
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nucleic adds, and modified oUgonucieotides) for antisense activity against a specific polynucleotide 
sequence (Brcice, T.W. et al. (1997) VS. Patent No. 5.686.242; Bruice, T.W. et al. (2000) U.S. 
Patent No 6.022.691). 

Many methods for introducing vectors into cells or tissues are available and equally suitable for 
use in vivo, in vitro , and ex viva For ex vivo therapy, vectors may be introduced into stem cefis taken 
from the patient and ckmaQy propagated lor autologous oransplam back into that same patient. 
Delivery by transection, by liposome injections, or by pclycationic amino polymers may be achieved 
using methods which are well known in the art. (See, eg., Goldman. C.K. et al. (1997) Nat. 
BiotechnoJ. 15:462-466.) 

Any of the therapeutic mcthotE; described above may be applied to any subject in need of such 
therapy, inchtdirtg, for e 




formulations are commonly known and are thoroughly discussed in the latest e 
Pharmaceutical Sciences (Maack Publishing. Easton PAY Such compositions may consist of GCREC, 
antibodies to GCREC, and tniinetics, agonists, antagonists, or inhibitors of GCREC. 

The compositions utilized in this Invention may be administered by arry number of routes 
including, but not limited to, oral, intravenous, intramuscular, intra -arterial, intramedullary, intrathecal, 
intraventricular, purmonary, transdermal, subcutaneous, intraperitoneal, intranasal, enteral, topical, 
sublingual, or rectal means. 

Compositions for pulmonary administration may be prepared in liquid or dry powder form. 

eraDy aerosolized immediatdy prior to inhalation by the patient. In the case 
s (eg. traditional low molecular weight organic drugs), aerosol delivery of fast-acting 
formulations is wdl-known In the art la the case of tnacromoleeules (eg larger peptides and proteins), 
recent developments in the Odd of pulmonary delivery via the alveolar region of the lung have enabled 
the practical delivery of drugs such as insulin to blood circulation (see, eg.. Patton, J.S. et al., U.S. 
Patent No 5.997,848). Pulmonary delivery has the advantage of adrmrastration without needle 
injection, and obviates the need ror potentially toxic penetration enhancers. 
Compositions suitable for use m the invention inctude cornpo sl ti 
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Specialized forms of compositions may be prepared for direct Intra cell u l ar delivery of 
rnacromolecules comprising GCREC or fragments thereof. For example, liposome preparations 
containing a cefl -impermeable macromolecule may promote ceD fusion and intracellular delivery of the 
macrontolecule. Alternatively, GCREC or a fragment thereof may be Joined to a short anionic N- 
5 terminal portion Iron) the HIV Tat-1 protein. Fusion protdns thus generated have been found to 
transduce into the cells of an tissues, including the brain. In a mouse model system (Schwarze, S.R. et 
al. (1999) Science 285:1569-1572). 

For any compound, the therapeutically effective dose can be estimated initially dther In cell 
culture assays, eg., of neoplastic cells, or in animal models such as mice, rats, rabbits, dogs, monkeys, 
10 or pigs. An animal model may also be used to determine the appropriate concentration range and route 
of administration- Such Information can then be used to determine useful doses and routes for 

i A therape uti c all y effective dose refers to that a m ou n t of active ingredient, for example GCREC 
or fragments thereof, antibodies of GCREC and agonists, antagonists or inhibitors of GCREC. which 

15 ameliorates the symptoms cr condition. Therapeutic efficacy and toxicity may be determined by 
standard pharmaceutical procedures fn cdl cultures or wiih experimental animals, such as by 
calculating the ED* (the dose rhcrapcutically effective in 50% of the population) or LD„ (the dose 
lethal to 50% of the population) statistics. The dose ratio of toxic to therapeutic effects is the 
therapeutic index, which can be expressed as the LDji/EDm ratio Compositions which exhibit large 

20 , therapeutic indtees are preferred. The data obtained from cdl coltnre assays and animal studies are 
used to formulate a range of dosage for human ose. The dosage contained in such cornpositions is 
preferably within a range of drculating concentrations that Indudes the ED* with Utile or no toxidty. 
The dosa ge varies within this range depending upon the dosage form employed, the sensitivity of die 
pati e n t, and the route of ad m i n i s tration. 

25 The exact dosage win be determined by the practitioner, In light of factors related to the subject 

requiring treatment. Dosage and administration are adjusted to provide suffident levels of the active 
moiety or to maintain the desired effect. Factors which may be taken Into account include the severity 
of the disease state, the general health of rhe subject, the age, weight, and gender of the subject, time 
and frequency of administration, drug COmbination(sX reaction sensitivities, and response to therapy. 

30 Long-acting crjrnposltions may be administered every 3 lo 4 days, every week, cr biweekly depending 
on the half-life and clearance rate of the particular formulation. 

Normal dosage amounts may vary from about 0. 1 ug to 100,000 "g. up to a total dose of 
about 1 gram, depencfing upon the route of adzirinisttafion. Guidance as to particular dosages and 
methods of delivery is provided In the literature and generally available to praai lionets in the art. 
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Those skilled in the an will employ different formulations for nucleotides than for protdns or their 
inhibitors. Similarly, delivery of polynucleotides or polypeptides will be specific to particular cells, 

DIAGNOSTICS 

5 In another embodiment, antibodies which specifically bind GCREC may be used for the 

diagnosis of disorders characterized by expression of GCREC, or in assays to monitor patients bdng 
treated with GCREC or agonists, antagonists, or inhibitors of GCREC. Antibodies useful for 
diagnostic purposes may be prepared in the same manner as described above for therapeutics. . 
Diagnostic assays for GCREC include methods which utilize the antibody and a label to detect GCREC 
10 In human body fluids or in extracts or cells or tissues. The amibocfles may be used with or without 
modification, and may be labeled by covalent or non-covalera attachment of a reporter molecule. A 
wide variety of reporter molecu les, several of which are described above, are known in the art and may 

A variety of protocols for measuring GCREC. including ELlSAs. RJ As, and FACS. are known 

15 Id the art and provide a basis for diagnosing altered or abnormal levels of GCREC expression. Normal 
or sta ndard values for GCREC expression are established by co mb i n i n g body fhtifh or cdl extracts 
taken from normal mammalian subjects, for example, human subjects, with antibodies to GCREC 
under conditions suitable for complex formation. The amount of standard complex formation may be 
qu an ti tat ed by various methods, such as photometric means. Quantities of GCREC expressed in 

20 subject, control, and disease samples from biopsied tissues arc compared with the standard values. 
Deviation b e t ween standard and subject values establishes the parameters for diagnosing disease. 

In another embodiment of the invention, the polynucleotides encoding GCREC may be used for 
diagnostic purposes. The polynucleotides which may be used inctude oligonuc leotid e sequences, 
complementary RN A and DNA molecules, and PN As. The polynucleotides may be oscd to detect and 

25 quantify gene expression in biopsied tissues in which expression al GCREC may be corrdated with 
disease. The diagnostic assay may be used to determine absence, presence, and e xc ess expression of 
GCREC and to monitor regulation of GCREC levels during therapeutic intervtrtlon. 

In one aspect, hybri di zat ion with PCR probes which are capable of detecting polynucleotide 
sequences, including genomic sequences, encoding GCREC or closely related molecu l es may be used to 

30 identify nucldc add sequences which encode GCREC. The spedfidry of the probe, whether It is made 
from a highly specific region, e.g.. the 5* regulatory region, or from a less specific region, eg., a 
conserved motif, and the stringency of the hybridization or amplification win determine whether the 
probe identifies any naturally occurring sequences encoding GCREC, allelic variants, or related 
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Probes may also be used for the detection of related sequences. tnnOy have at leas SOSb 
seqoeace Ideairy to any of the GCREC encoding scqueacs. The rrybridttation probes of the subject 
invention nay be DN A or RN A and may be derived from the sequence of SEQ ID NO. 22 -4 2 or tram 
genomic sequences including promcfcrs, enhancers, and rotrons of the GCREC gene. 
5 Means ft* producing specific hybridization probes for DNAs encoding GCREC include the 

cloning of polynucleotide sequences encoding GCREC or GCREC derivatives into vectors for the 
production of mRNA probes. Such vectors are known in the an, are co mm erc i ally available, and may 
be used to synthesize RN A probes in vitro by means of the addition of the appropriate UNA 
polymerases and the appropriate libeled nucleotides. Hybridization probes nay be labeled by a variety 
10 of reporter groups, for example, by radionuclides such as "P or "S, or by enzymatic labels, such as 
alkaline phosphatase coupled to the probe via avidin/btotin coupling systems, and the like. 

Porynucleotide sequences encoding GCREC may be used for the diagnosis of disorders 
associated with expression of GCREC. Examples of such disorders include, but are not limited to, a 



15 



vers of the adrenal gland, bladder, bone, bone marrow, brain, breast, cervix, gall 
bladder, ganglia, gastrointestinal tract, heart, kidney, liver, lung, muscle, ovary, pancreas, parathyroid, 
20 penis, prostate, salivary glands, skin, spleen, testis, thymus, thyroid, and uterus; a neurological 

disorder such as epilepsy, Ischemic cerebrovascular disease, stroke, cerebral neoplasms, Alzheimer's 
disease. Pick's disease, Huntington's disease, de m e nti a, Parkinson's disease and other extrapyramidal 
disorders, amyotrophic lateral sclerosis and other motor neuron disorders, progressive neural 
muscular atrophy, retinitis pigmentosa, hereditary ataxias, multiple sclerosis and other demyelinating 
25 diseases, bacterial and viral meningitis, brain abscess, subdural empyema, epidural abs c es s , 





developmental disorders of the central 



nervous system, cerebral palsy, neuroskeletal disorders, autonomic nervous system disorders, cranial 
nerve disorders, spinal cord diseases, muscular dystrophy and other neuromuscular disorders, 
peripheral nervous system disorders, derma tomyositis and polymyositis, inherited, metabolic, 
endocrine, and toxic myopathies, myasthenia gravis, periodic paralysis, mental disorders including 
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mood, anxiety, and schizophrenic disorders, seasonal affective disorder (SAD), akathesia. amnesia, 
catatonia, diabetic neuropathy, tardive dyskinesia, dystonias, paranoid psychoses, postherpetic 
neuralgia. Tourette's disorder, progressive supranuclear palsy, oorucobasal degeneration, and familial 



ftshil 



5 hypertension, vasculitis, Raynaud's dis ea s e , aneurysms, arterial dissecti ons, varicose veins. 

thrombophlebitis and phJebothrombosis, vascular rumors, complications of thrombolysis, balloon 
angioplasty, vascular re plac e m ent, and coronary artery bypass graft surgery, congestive heart failure, 
ischemic heart disease, angina pectoris, myocardial infarction, hypertensive heart disease, 
degenerative valvular heart dis e as e , calcific aortic valve stenosis, congenially bicuspid aortic valve, 

10 mitral annular calcification, mitral valve prolapse, rheumatic fever and rheumatic heart disease. 
Infective endocarditis, nonbacterial thrombotic endocarditis, endocarditis of systemic lupus 
erythematosus, carcinoid bean disrate , cardiomyopathy, myocarditis, pericarditis, n 



1 as dyspha, 
15 carci n o m a, dyspep 

gastroparests, antral or pyloric edema, abdominal angina, pyrosis, gastroenteritis, intestinal 
obstruction, infections of the intestinal tract, peptic ulcer, cnotelhhlasis. cholecystitis, cholestasis, 
pancreatitis, pancreatic carcinoma, Hilary tract disease, hepatitis, hyperbilirubinemia, cirrhosis, 
passive congestion of the river, hepatoma, infectious coll lis. ulcerative colitis, ulcerative proctitis, 
20 Crohn's disease. Whipple's disease. Mai lory- Weiss syndrome, colonic carcinoma, colonic 

obstruction, irritable bowel syndrome, short bowel syndrome, diarrhea, constipation, gastrointestinal 
hemorrhage, acquired immunodeficiency syndrome (AIDS) enteropathy, jaundice, hepatic 



antitrypsin deficiency, Reye's syndrome, primary sclerosing cholangitis, liver infarction, portal vein 



occlusive di s ea s e, preeclampsia, eclampsia, acute fatty liver of pregnancy, intrahepatic cholestasis of 
pregnancy, and hepatic tumors including nodular hyperplasias, adenomas, and carcinomas; an 
autoimmu ne/1 nfl ammat ory disorder such as acquired immunodeficiency syndrome (AIDS). Addison's 
disease, adult respiratory distress syndrome, allergies, ankylosing spondylitis, amyloidosis, anemia, 

30 asthma, aiherosclerosls, autoimmune hemolytic anemia, autoimmune thyroiditis, autoimmune 

polyendocrincpauty-candlcuasU^ dystrophy (APECED), bronchitis, cholecystitis, contact 

dermatitis. Crohn's disease, atopic dermatitis, derrnatomyosltis, diabetes meHitus, emphysema, 
episodic lymphopenia with lympnocytoroxins, erythroblastosis fetalis, erythema nodosum, atrophic 
gastritis, glomerulonephritis, Goodpasture's syndrome, gout. Graves* disease, Hashimoto's 

35 thyroiditis, bypereorinopbilia, irritable bowel syndrome, multiple si 



psoriasis, Reiter's syndrome, rheumatoid arthritis, scleroderma, Sjogren's syndrome, systemic 



colitis, uveitis. Werner syndrome, complications of cancer, hemodialysis, and extracorporeal 
5 circulation, viral, bacterial, fungal, parasitic protozoal, and helminthic Infections, and trauma; a 
metabolic disorder such as diabetes, obesity, and osteoporosis; and an infection by a viral agent 
classified as adenovirus, arenavirus, bunyavirus, caliclvirus, coronavirus, fflovirus, hepadna virus, 
herpesvirus, flavivirus, orthomyxovirus, parvovirus, papova virus, paramyxovirus, picoma virus, 
poxvirus, reovbus, retrovirus, rhabdovirus, and toogavirus. The rxtrynucleotide sequences encoding 
10 GCREC may be used In Southern or northern analysis, dot blot, or cither membrane-based trchxclogies; 
in PCR technologies; in dipstick, pin, and nwltlfbrmat EXISA-Uke assays; and in nucroarrays utilizing 



methods are well known in the art. 

In a particular aspect, the nucleotide sequences encoding GCREC may be useful in assays that 



sequences encoding GCREC may be labeled by standard methods and added to a fhrid or tissue sample 
from a patient under conffitiora suitable for the formation at hybridization complexes. After a suitable 
incubation period, the sample is washed and the signal Is quantified and co mp ar ed with a standard 
value If the amount of signal in the patient sample is significantly altered in comparison to a control 
20 sample then the presence of altered levels of mxleotide sequences encoding GCREC in the sample 

inflcates the presence of the associated disorder. Such assays may also be used to evaluate the efficacy 
of a particular therapeutic treatment regimen to animal studies, in clinical trials; c* ro monitor the 



In order to provide a basis for the diag n o sis of a disorder associated with expression of 
25 GCREC, a normal or standard profile for expression b established. This may be accctnptished by 
gbodyrfajtdsorceDe 



amplification. Standard hybridization may be quantified by comparing the values obtained from normal 



30 polynucleotide Is used Standard values obtained In this a 

from samples from patients who are symrjrotnafic for a disorder. Deviation from standard values Is 
used to establish the presence of a disorder. 

Once the presence of a disorder b established and a ireatment protocol is initiated, 

i regular basis to determine if the level of expression in the 



patient begins to approximate chat which is observed in 
successive assays may be used to show the efficacy of t 



The results obtained from 
over a period ranging from several 



With respect to cancer, the presence of an abnormal amount of transcript (either under- or 
5 overexpressed) in biop si e d tissue from an individual may indicate a pr e disposition for the development 
of the disease, or may provide a means for detecting the disease prior to the appearance of actual 
clinical symptoms. A more definitive diagnosis of this type may allow health professionals to employ 
preventative measures or aggressive treatment earlier thereby preventing the development or further 
progression of the cancer. 

10 Additional diagnostic uses for oligonucleotides designed from the sequences encoding GCREC 

may involve the use of PCR. These oligomers may be chemically synthesized, generated eraymaiicaHy, 
or produced in vitro . Oligomers will preferably contain a fragment of a porynucleotide encoding 
GCREC, or a fragment of a polynucleotide contrMementary to the polynucleotide encoding GCREC, and 
will be employed tinder optimized conditions for ide ntifi cation of a specific gene or condition. 

15 Oligomers may also be employed under less s t r ingen t conditions for detection or q uan tifi cati on of 
closely related DNA or RMA sequences. 

In a particular aspect, ntigonucleotide primers derived from the polynucleotide sequences 
encoding GCREC may be used to detect single nucleotide polymorphisms (SNPs). SMPs are 
substitutions, insertions and deletions that are a frequent cause of inherited or acquired genetic disease 

20 In humans. Methods of SNP detection include, but are not linriied to, single-stranded ccxifbrmatian 
polymorphism (SSCP) and fluorescent SSCP (fSSCP) inerhods. In SSCP. cfigo n u cVori de primers 
derived from the pcJyraicieotide sequences encoding GCREC are used to amplify DNA using the 
poiyrnernse chain reaction (PCR). The DNA may be derived, for example, from diseased or normal 
tissue, biopsy samples, bodily fluids, and the like. SNPs in the DNA cause differences in the secondary 

25 and tertiary structures of PCR products in single-stranded form, and these differences are detectable 
using gel electrophoresis to non-denaturing gels. In fSCCP. the oUgonucleotide primers are 



DNA st 



(IsSNP), are capable of identifying pc*ymorphisrm by comparing the sequence of Individual 
overlapping DNA fragments which assemble Into a common consen su s sequence , These computer- 
based methods filter out sequence variations due to laboratory preparation of DNA and sequencing 
errors using statistical models and automated analyses of DNA sequence darorratograms. In the 
alternative, SNPs may be d e t ec t ed and characterized by mass spectrometry using, for example, the high 



throughput MASS ARRAY system (Sequenom, Inc.. San Diego C A). 
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Methods *4&eb may also be used to q u a n tify die expression o 
biatjrrylaunj nucleotides, co arn plificalion of a control nucleic Kid, and Interpolating results from 
standard curves. (See, eg.. Mdby. P.C et al. (1993) J. Immunol. Metfnds 159:235-244; Duplaa, C. et 
si. (1993) Anal. Bkxtan. 212:229-236.) Use speed of quantitation of multiple samples may be 
5 accelerated by running the assay in a high-throughput format where the oligomer or polynucleotide of 
interest is presented in various dilutions and a spectropbotornetric or coJorirnetric response gives rapid 
quantitation. 

In farther emrxxBmcnts, oligonucleotides or longer fragments derived from any of the 
polynucleotide sequences described herein may be used as dements on a inks o&i ray. The trucroarray 

10 can be used in transcript imaging ttxrtrnqiies which monitor the relative expression levels of large 
numbers of genes simultaneously as described below. The microamy may also be used to identify 
genetic variants, mutations, and polymorphisms. This information may be used to determine gene 
function, to understand the genetic basis of a disci do, to diagnose a disorder, to mo n i t or 
prrrgressionrTegjes-Jon of disease as a ftmction of gene expression, and to develop and monitor the 

15 activities of therapeutic agents Id the treatment of disease. In particular, this information may be used 
to develop a pharntacogenomic profile of a patient in order to s el e c t the most appropriate and effective 
treatment regimen for that patient. For example, therapeutic agents which are highly effective and 
display the fewest side effects may be selected for a patient based on his/her prwrnacogertomic profile. 
In another embodiment, GCREC fragments of GCREC. or antibodies specific for GCREC 

20 may be used as elements on a rnkroarray. The mkroamy may be used to monitor or measure protein- 
protein interactions, drug-target interactions, and gene expression profiles, as described above, 

A particular embodiment relates to the use of the polynucleotides of the present invention to 
generate a transcript image of i (issue or cell type. A transcript image repre se nt s dm global pattern of 
gene expression by a particular tissue or cdl type. Global gene expression patterns arc analyzed by 

25 quantifying the number of expressed genes and their relative abundance under given conditions and at a 
given time. (See SeGhamer el al, "Comparative Gene Transcript Analysis,** U.S. Patent Number 
5,840,484, expressly incorporated by refere nc e herein.) Thus a transcript image may be generated by 
hybridizing the polynucleotides of the present invention or their complements to the totality of 
transcripts or reverse transcripts of a particular tissue or cell type. In one embodiment, the 

30 hybridization takes place tn high-throughput format, wherein the polynucleotides of the present 
invention or their complements comprise a subset of a plurality of elements on a nncroarray. The 
resultant transcript image v**ild provide a profile of gene activity. 

Transcript images may be gene r ate d using transcripts isolated from tissues, cell lines, biopsies, 
or other biological samples. The transcript image may thus reflect gene expression in vim as in the 
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case of a tissue or biopsy sample, or In vitro, as in the case of a cell tine. 

Transcript images which profile the expression of the pdyrarcleotides of the present invention 
may also be used hi conjunction with |n yi-rp nwW systems and preclinical evaluation of 
ptumac oBttais, as well as tcndcological testing of industrial and naturaDy-occurring environmental 



molecular firigerprinu or toxicant signatures, which arc indicative of mecbaiusna of action and toxicity 
(Nuwaysir. EJ\ et al. (1999) MoJ. Carcinog. 24:153-159; Stdnex. S. and NX, Anderson (2000) 
Toxicol. Leo. 112-1 1 3:467-47 1 . expressly incorporated by reference herein). If a test compound has a 
signature rimflar to that of a compound with known toxicity, it is likely to share those toxic properties. 
10 These fingerprints or signatures are most useful and refined when they contain expression information 
from a large number of genes and gene families. Ideally, a genome- wide measurement of expression 
provides the highest quality signature. Even genes whose expression is not altered by any tested 
compounds are Important as well, as the leveb of expressiopcf these genes 

rest of the expression data. The normalization procedure is useful for co m parison of expression data 
15 after treatment with different compounds. While the assignment of gene function to elements of a 
toxicant signature aids in interpretation of toxicity mechanisms, knowledge of gene function is not 
necessary for the statistical matching of signatures which leads to prediction of toxicity. (See, for 
example. Press Release 00-O2 froth the National Institute of Envrrorrrnental Health Sciences, released 
February 29, 2000, available at r*tp7/www.r*ehsj»ihgOY/cc^ Therefore, ft is 

20 important and desirable in toxicoiogica] screening using toxicant signatures to include all expressed 



In one embodiment, the toxldly of a test compound is assessed by treating a biological sample 
containing nucldc adds with the test compound. Nucleic adds that are expressed in the treated 
biological sample are hybridized with one or more probes specific to the polynucleotides of the 
25 present invention, so that transcript levels correspond! ng to the polynucl eotides of the present I 
invention may be quantified. The transcript levels in the treated biological sample are com p are d with 
levels in an untreated biological sample. Differences in the transcript levels between the rwo samples 



30 invention to analyze the proteome of a tissue or cell type. The term proteorne refers to the global 

pattern of prctrin expression in a particular tissue or cdl type. Each protdn component of a proteorne 
can be subjected individually to further analysis. Proteorne expression paaerns, or profiles, are 



conditions and at a given time. A profile of a cell's proteorne may thus be generated by separating and 
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analyzing the polypeptides of a particular tissue or cell type. In one embodiment, the separation Is 
achieved using rwo-dirriensional gd decsroprnresis. in which proteins from a sample are separated by 
isoelectric focusing in the first dimension, and then according to molecular wdght by sodium dodecyl 
sulfite slab gd electrophoresis in the second dimension (Stdner and Anderson, moral . The proteins are 
5 visualized in the get as discrete and uniqudy positioned spots, typically by staining the gd with an agent 
such as Coomassie Blue or silver or fruorescers stains. The optical density of each protdn spot is 
generally proportional to the levd of the protdn in the sample. The optical densities of equivalemJy 
positioned protein spots from different samples, for example, from biological samples either treated or 
untreated with a test compound or therapeutic agent, are compared to identify any changes In protdn 

10 spot density related to the treatment. The proteins In the roots are partially sequenced using, for 

example. Standard methods employing chemical or enzymatic cleavage followed by mass spectrometry. 
The identity of the protein In a spot may be determined by comparing Its partial sequence, preferably of 
at least 5 contiguous amino add residues, to the polypeptide sequences of the present lirvention. In 
some cases, further sequence data may be obtained for definitive protdn Identification. 

15 A proteomic profile may also be generated using antibodies specific for GCREC to quantify the 

levels of GCREC expression, in one embodiment, the antibodies are used as elements on a rnkroarray, 
and protdn expression levels are quantified by exposing the rraaoairay to the sample and detecting the 
levers of protdn bound to each array dement (Lucking. A o al. (1999) Anal. Biochem. 270:103- 111; 
Mendoze. L.G. et al. (1999) Biotednsques 27:778-788). Detection may be performed by a variety of 

20 methods known in the art. for example, by reacting the proteins in the sample with a thiol- or ami no- 
reactive fluorescent compound and detecting the amount uf JTuoretcence bound at each array dement. 

Toxicant signatures at the proteorne levd are also useful for lexicological screening, and should 
be analyzed in paralld with toxicant signatures al the o-anscript level. There is a poor corrdatian 
between transcript and protdn abundances for some proteins in some tissues (Anderson. N.L. and J. 

25 Sdmamcr (1997) Becuopboresis 18:333-537), so proteorne toxicant rigmuures may be useful in che 



proteomic profile In addition, the analysis Of transcripu in body fluids is difficult, due to rapid 
degradation of mRNA. so prrxerjnrtc profiling may be more reliable and Informative In such cases. 
In another embodiment, the toxicity of a test compound is assessed by treating a biological 



sample are separated so that the amount of each protdn can he quantified. The a 
is compared to the amount of the corresponding protdn in an untreated biological sample, A difference 
in the amount of protdn between the two samples is indicative of a toxic response to the test compound 
in the treated tample Individual protdns are identified by sequendng the amino acid residues of the 
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individual protdns and comparing these partial sequences to the p 

In another e m bo d iment, the toxicity of a test compound is a 
sample containing protdns with the test compound. Protdns from the biological sample are incubated 
with antibodies spedflc to the polypeptides of the present invention. The amount of protdn recognized 
by the antibodies is quantified The amount of prolan in the treated WotogicaJ sample is compared with 
the amount in an untreated biological sample. A difference in the amount of protein between the two 
samples is indicative of a toxic response to the test compound in the treated sample. 

Mlcroarrays may be prepared, used, and analyzed using methods known in the art. (See. eg, 
Brennan. T.M. et al. (1995) U.S. Patent No. 5,474.796; Schena, M. d al. (1996) Prcc. Nati. Acad. Sd. 
USA 93: 10614-1 06 1 9; Baldeschwdler et al. (1995) PCT application W09372511 16; Shaton, D. etal 
(1995) PCT application WO95/35505; Hdler. R.A et al. (1997) Prcc Natl. Acad. Set USA 94:2150- 
2153; and Hdler. MJ. et al. (1997) U.S. Patent No. 3,605.662.) Various types of rakyoarrayi are well 
known and thoroughly described in DNA Microarravs: A Practical Approach. M. Schena. cd f 19991 
Oxford University Press, London, hereby expressly incorporated by reference. 

In a 



coding or noocoding sequences may be used, and in sr. 
preferable over coding sequences. For example, conservation of a coding sequence among members 
of a multi-gene family may potentially cause unties! red cross hybridization during chromosomal 
mapping. The sequences may be mapped to a particular ch 'O iiK> M | *:K\ tn a rpevlfir region of a 
chromosome, or to artificial chromosome ecrnstructions, eg., human artificial chromosomes (HACs), 
yeast artificial cbronxsornes (YACs). bacterial artificial cfcrccrnsotnes (BACs), bacterial PI 
constructions, or single chromosome cDNA libraries. (Sec, eg., Harrington, JJ. a al. (1997) Nat. 
Genet. 15.345-335; Price, CM. (1993) Blood Rev. 7:127-134; and Tr ask, BJ. (1991) Trends Genet 
7:149* 1 34.) Once mapped, the nucldc add sequences of the Invention may be used to develop genetic 




data. (See, eg.. Hdra-Ufrich. et al. (1995) in Meyers, Kupra. pp. 963-968.) Examples of genetic map 
data can be found in various scientific journals or at the Online Menddian Inheritance in Man (OMIM) 
World Wide Web site Corrdatioo between the location of the gene encoding GCREC on a physical 
map and a specific disorder, or a predisposition to a specific disorder, may brip define the region of 
DNA associated with that disorder and thus may further positional cloning efforts. 
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]n_syj hybridization of ctroroosomal preparations and physical n!^p^ techniques, such as 
linkage analysis using established chromosomal markers, may be used for extending genetic maps. 
Often the pUceraem of a gene on the chromosome of another mammalian species, such as mouse, may 
reveal associated markers even tfthe exact chromosomal tocos is not known. This inrormarJoa is 
5 valuable to investigators searching for disease genes using positional dotting or other gene discovery 
techniques. Once the gene or genes responsible for a disease or syndrome nave been crudely localized 
by genetic linkage to a particular genomic region, eg., stasia-telangiectasia to 1 lq22-23, any sequences 
mapping to that are* may represent associated or regulatory genes for farther investigation. (See, eg., 
Garti. R.A. et al (1988) Nature 336" 377-580.) The nucleotide sequence of the Instant invention may 

10 also be osed to detect differences in the chromosomal location doe to translocation. Inversion, etc. 
among normal, carrier, or affected individuals. 

In another exnbocBmo* of the invention, GCREC. its catalytic or immunogenic fragments, or 
oligopeptides thereof can be used for screening libraries of compounds in any of a variety of drug 
screening techniques. The fragment employed in such screening may be free ta sofatifx\ affixed to • 

IS solid support, borne on a ceQ surface, or located iittracellularty. The formation of binding cornplexes 
between GCREC and the agent being tested may be measured. 

Another technique for drug screening provides for Ugh throughput screening of compounds 
having suitable binding affinity to the protein of Interest. (See, eg., Geysen, el al. (1984) PCT 
application WO84/03564.) In this method, large numbers of different small test compounds are 

20 synthesized an a solid substrate. The test compounds are reacted with GCREC, or fragments thereof, 
and washed. Bound GCREC is then detected by methods wen known in the in. Purified GCREC con 
also be coated directly onto plates for use in the aforementioned drug screening techniques. 
Alternatively, non-neutralizing antibodies can be used to capture the peptide and immobilize it on a 
solid support 

25 In another crrtxxfiment. one may use competitive drug screening assays In which neutralizing 

antibodies capable of binding GCREC specifically compete with a test compound for binding GCREC. 

In this manner, antibodies can be used to d et e ct the presence of any peptide which shares one or more 

antigenic determinants with GCREC. 

In additional embodiments, the nucleotide sequences which encode GCREC may be used in any 
30 molecular biology techniques that have yet to be de>r*opcd, provided the new techniques reJy on 

properties of nucleotide sequences that arc currently known, inchxfing. but dm limited to, such 

properties as the triplet g e netic code and specific base pair interactions. 

Without further elaboration, it is believed that one skilled in the art can. using the preceding 

description, utilize the present invention to Its fullest extent The following embodiments are, 
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therefore, to be construed as merely illustrative, and not limitative of the remainder of the disclosure 
in any way whatsoever. 

The disclosures of all patents, applications and p ublicat ions, mentioned above and below. In 
particular U.S. See. No 60/180.093 and U.S. Ser No. 60/182,045. tie expressly haaorptrated by 
5 reference herein. 

EXAMPLES 

1. Construction of cDNA Libraries 

Incyte cDNAs were derived from cDNA libraries described in the UFESEQ GOLD d at a b as e 

10 (Incytc Genomics, Palo Attn CA) and shown in Table 4, cohrmn 5. Some tissues were bomogenized 
and lysed in guarndlnJum Iscxhiocyanate, while others were brjmog enbx d and tyred in phenol or in a 
suitable mixture of denaturants, such as TRIZOL (Life Technologies), a monophasic solution of phenol 
and guankfine isothiocyanate The resulting lysates were cemrifuged over CsO cushions or extracted 
with chloroform. RN A was precipitated from die lysates whh either isopropanoJ or sodium acetate and 

15 ethanol, or by other routine methods. 

Phenol extraction and prectpiutioa of RNA were repeated as necessary to Increase RNA 
purity. In some cases, RNA was treated with DNase. For most libraries, poly(AH RNA was isolated 
using o»go dfO-cuupled paramagnetic particles (Promega), C4JGOTEX latex particles (QIAGEN, 
Cbatsworth CA), or an OUGOTEX tnRNA puriBcauon kit (QIAGEN). Alternatively. RNA was 

20 isolated directly from tissue lysates using other RNA isolation kits, eg., the POLY(A)PURE mRNA 
purification kit (Amnion, Austin TX). 

In some cases, Stratagcne was provided with RNA and constructed the corresponding cDNA 
libraries. Otherwise, cDNA was synthesized and cDNA libraries were constructed whh the UNTZAP 
vector system (Stratagcne) or SUPERSCRIPT plasmid system (Life Technologies), using the 

25 recommended procedures or similar methods known in the art. (See, eg.. Ausubd, 1997, supra, units 
5.1-6.6.) Reverse transcription was initiated using oligo d(T) or random primers. Synthetic 
oligonucleotide adapters were tigated to double stranded cDNA. and the cDNA was digested with the 
appropriate restriction enzyme or enzymes. For most libraries, the cDN A was size- selected (300-1000 
bp) using SEPHACRYL SI 000, SEPH AROSE OL2B. or SEPH AROSE CL4B column 

30 cirrornatography (Amersham Pharmacia Biotech) or preparative agarose gd electrophoresis. cDNAs 

PBLUESCRIPT piasmid (Siratagene), PSPORTI plasmid (Ufe Technologies). PCDNA2.1 plasrmd 
(Invitrogcn, Carlsbad CA), PBK-CMV plasmid (Stratagene), or pINCY (Incytc Genomics, Palo Alto 
CA), or derivatives thereof. Recombinant plasmids were transformed into competent E. coli c el ls 
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iiichtdlng XL 1 -Blue, XLl-BlueMRF. or SOLR from Straugeneor DH5o, DH10B. or BectroMAX 
DH10B from Life Technologies. 

II. Isolation of cDNA Clones 

Plasmids obtained as described In Example I were recovered from host cells by h>vivo excision 
5 using the UN1ZAP vector system (Stralagene) or by ceD lysh. Plasmids were purified using at least 
one of the following: a Magic or WIZARD Mini preps DNA purification system (Promega); an AGTC 
Miniprep purification kit (Edge Bicoystems. Galthersburg MD); and Ql AWELL 8 Plasmid, Q1AWELL 
8 Phis Plasmid. QiAWELL 8 Ultra Plasmid purification systems or the R EAL PREP 96 plasmid 
purification kit from QIAGEN. Following precipitation, plasmids were resuspended in 0.1 ml of 
10 distilled water and stored, with or without lyophllization, at 4*C. 

Alternatively, plasmid DNA was amplified from host cdl lysates using direct link PCR In a 
ttgh-thruugbpul format (Rao, V.B. (1994) Anal Bfcxhem. 216:1-14). Host cell lysis and thermal 
cycling steps were carried out in a single reaction mixture. Samples were processed and stored in 384- 
weD plates, and the concentration of amplified plasmid DNA was quantified fluorometricaily using 
1 5 P1COGREEN dye (Molecular Probes. Eugene OR) and a FLUOR OS KAN II fluorescence scanner 
(Labsyttans Oy, Helsinki. Finland). 

III. Sequencing and Analysts 

Incytc cDNA recovered In plasmids as described in Example IT were sequenced as follows. 
Sequencing reactions were pn yrssfd using standard methods or high-throughput Instrumentation 

20 such as the AB1 CATALYST 800 (Applied Biosystems) thermal cycler or the PTC-200 thermal cycler 
(MJ Research) In conjunction with the HYDRA mlcrodlspenser (Robbins Scientific) or the 
MICROLAB 2200 (HtmiHon) liquid transfer system. cDNA sequencing reactions were prepared 
using reagents provided by Amersham Pharmacia Biotech or supplied in ABI sequencing kitt such as 
the ABI PRISM BIGDYE Terminator cycle sequencing ready reaction kit (Applied Biosystems). 

25 Qectrophoretic separation of cDNA sequencing reactions and detection of labeled pcrfyrajcJeotides were 
carried out using the MEG AB ACE 1000 DNA sequencing system (Molecular Dynamics); the ABI 
PRISM 373 of 377 sequencing system (Applied Biosystems) fat conjunction with standard ABI 
protocols and base calling software; or other sequence analysis systems known In the art. Reading 
frames within the cDNA sequences were identified using standard methods (reviewed In Ausubd, 1997, 

30 acta, unit 7.7). Some of the cDNA sequences were selected for extension using the techniques 
disclosed in Example VTJ1. 

The polynucleotide sequences derived from Incytc cDNAs were validated by removing vector, 
tinker, and poly(A) sequences and by masking ambiguous bases, using algorithms and programs based 
on BLAST, dynamic programming, and dl n ucleotide nearest neighbor analysis. The incytc cDNA 
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sequences or translations thereof were then queried against a selection of public databases such as the 
GenBank primate, rodent, mammalian, vertebrate, and eukaryote databases, and BLOCKS. PRINTS. 
DOMO. PRODOM. and hidden Markov model (HMM)-based protein family databases such as PFAM. 
(HMM is a probabilistic approach which analyzes consensus primary structures of gene families. 
5 See, for example, Eddy. S.R. (1996) Curr. Opdn. Struct. Biol. 6:361-365.) The queries were 

performed using programs based on BLAST, FAST A BLIMPS, and HMMER. The Incytc cDN A 
sequences were assembled to produce lull length polynucleotide sequences. Alternatively. GenBank 
cDNAs, GenBank ESTs, stitched sequences, stretched sequences, or Genscan-predictcd coding 
sequences (see Examples IV and V) were used to extend Incyte cDNA assemblages to full length. 

10 Assembly was performed using programs based cm Phred, Phrap, and Corned, and cDNA assemblages 
were screened for open reading frames using programs based on GeneMark, BLAST, and FAST A. 
The fun length polynucleotide sequences were translated to derive the corresponding full length 
polypeptide sequences. Alternatively, a polypeptide of the indention may begin at any of the methionine 
residues of the full length translated polypeptide Full length polypeptide sequences were subsequently 

15 analyzed by querying against databases such as the GenBank protein databases (genpept), SwissProt, 
BLOCKS, PRINTS, DOMO, PRODOM. Proslte, and hidden Markov model (HMM>based protein 
family databases such as PFAM. Full length polynucleotide sequences are also analyzed using 
MACDNAS1S PRO software (Hitachi Software Engineering, South San Francisco CA) and 
LASER GENE software (DN AST AR). Polynucleotide and polypeptide sequence alignments are 

20 generated Bsing default parameters specified by ihe CLUSTAL algorithm as incorporated into the 
MEG ALIGN rnurtiscquence alignment program (DNASTAR), which also calculates the percent 
identity between aligned sequences. 

Table 7 summarizes the tools, programs, and algorithms used for the analysis and assembly of 
Incyte cDN A and fuD length sequences and provides applicable descriptions, references, and threshold 

25 parameters. The first column of Table 7 shows the tools, programs, and algorithms used, the second 
column provides brief descriptions thereof, the third column presents appropriate references, all of 
which are Incorporated by reference herein in their entirety, and the fourth column prese n ts, where 
applicable, the scores, probability values, and other parameters osed to evaluate the strength of a match 
between two sequences (the higher the score or the lower the probability value, the greater the Identity 

30 between two sequences). 

The programs described above for the assembly and analysis of roll length polynucleotide and 
polypeptide sequences were also used to identify polynucleotide sequence fragments from SEQ ID 
NO:22-42. Fragments from about 20 to about 4000 nucleotides which are useful in hybridization and 
amplification technologies are described In Table 4, column 4. 
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e rrauaily identified by running the Genua o gene 
identification program against public genomic sequence databases (eg., gbpri and gbhtg). Censcan is a 
general-purpose gene joenti ftrat io r i program which analyzes genomic DN A sequences from a variety of 
organisms (See Burge. C and S. KarUn (1997) J. Mol. Bid. 268:78-94, and Burge, C. and S. Kariin 
(1998) Curr. Opin Struct. Bio). 8346-354). The program concatenates predicted exons to form an 
ass e m b le d cDNA sequence e xt e nd i n g from a methionine to a stopecdon. The output of Censcan is a 
FASTA database of polynucleotide and polypeptide sequences- The maximum range of sequence for 
Genscan to analyze at once was set to 30 kb. To determine which of these Censcan predicted cDN A 
sequences encode G-protdn coupled receptors, the encoded polypeptides were analyzed by querying 
against PFAM tnodets for G-protdn coupled receptors. Potential G-protdn coupled receptors were also 
identified by homology to lncyte cDNA sequences that bad been annotated as G-protdn coupled 

d Geriscan-prerfiaed sequences were then compared by BLAST analysis to the 



genpept and gbpri public databases. Where necessary, the Genscan-predicted s equ e nc es were then 
edited by comparison to the top BLAST hit from genpept to correct errors in the s equenc e predicted by 
. BLAST analysis was also used to rind any lncyte cDNA or 
s providing evidence for transcription. 
When lncyte cDNA coverage was available, this information was used to correct or confirm the 
Censcan predicted sequence. FuD length polynucleotide s equen ce s were obtained by assembling 
Genscan-pretfictcd coding sequences with lncyte cDNA sequences and/or public cDNA sequences using 
the assembly process described in Exa m ple m. Alternatively, fun length polynucleotide sequences wen 
derived entirely from edited or unedited Genscan-predicted coding sequences. 
V. Assembly of Genomic Sequence Data with cDNA Sequence Data 



23 Partial cDNA ft 

program described in Example TV. Partial cDNAs assembled as described In Example III were mapped 
to genomic DNA and parsed into clusters containing related cDNAs and Genscan ex or predictions from 
one or more ge nom i c sequences. Each cluster was analyzed using an algorithm based on graph theory 
g to integrate cDNA and genomic information, generating possible splice 



S e qu e nce intervals in which the entire length of the interval was present on more than one sequence in 
the cluster were ide n tified, and intervals thus identified were co n s i de r ed to be equivalent by transitivity. 
For example, if an Interval was present on a cDN A and two genomic s eq ue nce s, then all three Intervals 
were considered to be equivalent. This process allows unrelated but co n secutive genomic seq uences to 
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be brought together, bridged by cDNA sequence. Intervals ihus identified were then "Hitched" together 
by the sutching algorithm In the order that they appear along their parent s eq ue n ces to generate the 
longest possible sequence, as well as s equ e n ce variants. 1 infcages between intervals which proceed 
along one type of parent seqnenoe (cDN A to cDNA or g en o m i c segue nre to genomic sequence) were 
3 given preference over linkages which change parent type (cDN A to genomic sequence). The resultant 
stitched sequences were translated and compared by BLAST analysis to the genpept and gbpri public 
databases. Incorrect exoos predicted by Genscan were corrected by comparison to the top BLAST hit 
from genpept. Sequences were further ex t e n d ed with additional cDNA sequences, or by inspection of 
ge n o m ic DNA, when ne cessaiy. 

10 "StretcheJ" Sequences 

Partial DNA sequences were extended to fun length with an algorithm based on BLAST 
analysis. First, partial cDNAs assembled as described in Example m were queried against public 
databases such as the GenBank primate, rodent, mammalian, vertebrate, and eukaryote databases using 
the BLAST program. The nearest GenBank protein hornolog was then compared by BLAST analysis 

13 to either lncyte cDNA sequences or GenScan exon predicted sequences described In Example IV. A 
chimeric protein was generated by using the resultant high-scoring s egmen t pairs (HSPs) to map the 
translated sequences onto the GenBank protein hornolog. Insertions or deletions may occur in the 
chimeric protein with respect to the original GenBank protein bomolog. The GenBank prolan hornolog. ' 
the chimeric protein, or both were used as probes to search for homologous genomic sequences from the 

20 public human genome databases. Partial DNA sequences were therefore "stretched" or extended by the 
addition of hctrookigous geoonac sequences. The resultant stretched sequences were examined to 

VL Chromosomal Mapping of GCREC Encoding Poly node© tides 

The sequences which were used to assemble SEQ ID NO: 22-42 were compared with 

23 sequences from the lncyte L1FESEQ database and public domain databases using BLAST and other 
implementations of the Smith- Waterman algorithm. Sequences from these databases that matched 
SEQ ID NO:22-42 were assembled into dusters of contiguous and overlapping sequences using 
assembly algorithms such as Phrap (Table 7). Radiation hybrid and genetic mapping data available 
from public re so ur c es such as the Stanford Human Genome Center (SHGQ. Whitehead Institute for 

30 Genome Research (WIGR), and Genethon were used to determine if any of the clustered sequences 
had been previously mapped. Inclusion of a mapped sequence in a duster resulted in the assignment 
of all sequences of that duster, including its particular SEQ ID NO:, to that map location. 

Map locations are represented by ranges, or intervals, or human chromosomes. The map 
position of an Interval, in centi Morgans, is measured relative to the terminus of the chromosome's p~ 
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arm. (The centi Morgan (cM) is a unit of measurement based on recombination frequencies between 
chromosomal markers. On average, 1 cM is roughly equivalent to I megabase (Mb) of DNA in 
humans, although this can vary widely due to hot and cold spots of recombination.) The c.M 
distances are based on genetic mar ken mapped by Genethon which provide boundaries for radiation 
5 hybrid markers whose sequences were included in each of the dusters. Human genome maps and 
other resources available to the public, such as the NCBI "GeneMap'99" World Wide Web site 
(http7/www jjcbijilm.nih.gov/genemapO, can be employed to determine if previously identified 
disease genes map within or in proximity to the intervals indicated above. 
VII. Analysis of Polynucleotide Expression 
10 Northern analysis is a laboratory technique used to detect the presence of a transcript of a gene 

and involves the hybridization of a labeled nudeotide sequence to a membrane on which RNAs from a 
particular cdl type or tissue have been bound. (See, eg.. Sambrook, .supra , ch. 7; Ausubd (1995) 
supra, ch. 4 and 16.) 

Analogous computer techniques applying BLAST were used to search for identical or related 
15 molecules in cDNA databases such as GenBank or LIFESEQ (lncyte Genomics). This analysis is 
much faster than multiple membrane-based hybridizations. In addition, the sensitivity of the computer 
search can be modified to determine whether any particular match is categorized as exact or similar. 
The basis of the search Is tbe product score, which is defined as: 

20 BLAST Score x Percent Identity 

5 x minimum (lengtb(Seq IX lenguXSeq, 2)) 

The product score takes into account both the d eg r ee of similarity between two sequences and the length 
of the sequence match. The product score is a ncxTDallzed value between 0 and 1 00, and is calculated 

23 as follows: the BLAST score Is multiplied by the percent nudeotide identity and the product Is divided 
by (5 times the length of the shorter of the two sequences). The BLAST score is calculated by 
assigning a score of +5 for every base that matches in a high-scoring segment pah (HSP). and -4 for 
every mismatch. Two sequences may share more than one HSP (separated by gaps). If there to more 
than one HSP, then the pair with the highest BLAST score is used to calculate the product score. The 

30 product score represents a balance between fractional overlap and quality in a BLAST alignment- For 
example, a product score of 1 00 U produced only for 1 00* identity over the entire length of the snorter 
of the two sequences bdng compared. A product score of 70 is produced either by 100% identity and 
70* overlap at one end. or by 88% kJenUty and 100* overlap at the other. A product score of 50 is 
produced either by 1 00* identity and 30* overlap at one end, or 79* identity and 100% overlap. 
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Alternatively, polynudeotide sequences encoding GCREC are analyzed with respect to the 
tissue sources from which they were derived. For example, some full length .sequences are assembled, 
at least in part, with overlapping lncyte cDNA sequences (see Example III)- Each cDNA sequence is 
derived from a cDNA library constructed from a human tissue. Each human tissue to classified into one 
5 of the following organ/tissue categories: cardiovascular system; connective tissue; digestive system; 
embryonic structures; endocrine system; exocrine glands; genitalia, female; genitalia, male; germ cdls; 
hemic and immune system; liver; musculoskeletal system; nervous system; pancreas; respiratory 
system; sense organs; skin; stomatognatrdc system; undasrified/inrxed; or urinary tract. Tbe number of 
libraries in each category to counted and divided by the total number of libraries across all categories. 

10 Similarly, each human tissue Is classified into one of tbe following disease/condition categories: cancer, 
cdl line, developmental, inflammation, neurological, trauma, cardiovascular, pooled, and other, and the 
number of libraries in each category to counted and cfivided by the total number of libraries across *D 
categories. The resulting percentages reflect the tissue- and disease-specific expression of cDNA 
encoding GCREC. cDNA sequences and cDNA library/tissue information are found in the LIFESEQ 

15 GOLD database (lncyte Genomics, Palo Alto CA). 

VIII. Extension of GCREC Encoding Polj nucleotides 

FuD length pdynodeocide sequences wore also produced by extension of an appropriate 
fragment of the fun length naolccule using oligonucleotide primers designed from this fragment. One 
primer was syrahesized to initiate 5' extension of the known fragment, and the other primer was 

20 synthesized to initiate 3' extension of the known fragment. The tartix! primers were designed using 
OLIGO4.06 software (National Biosciences), or another ap p r opr i a t e program, to be about 22 to 30 
nucleotides In length, to have a GC content of about 50* or more, and to anneal to the target sequence 
at ternperatures of about 68 *C to about 72*C Any stretch of endeotides which would result In hairpin 
structures and primer -primer dimerizations was avoided. 

23 Selected human cDNA libraries were used to extend the sequence. If more lhac one extension 

was necessary or desired. additional or nested sets of primers woe designed. 

High Oddity ampufication was obtained by PCR using methods well known in the art. PCR 
was performed In 96: well plxtes using the PTC-200 thermal cycler 0VU Research, Inc.). Trie reaction 
mix contained DNA template, 200 mnol of each primer, reaction boner containing Mg», (NHJjSO,. 

30 and 2-mereaptoethand, Taq DNA polymerase (Ameraham Pharmada Biotech), ELONGASE enzyme 
(Life Technologies), and Pfu DNA polymerase (StratageneX "tab the following parameters for primer 
pair PCI A and PCI B: Step 1: 94'C. 3 min; Step 2; 94*C 15 sec; Step 3: 60*C, 1 ram; Step 4: 68*C, 
2 min; Step 5: Steps 2, 3, and 4 repeated 20 times; Step 6: 68*C, 3 min; Step 7: storage at 4*C. In the 
alternative, the parameters for primer pair T7 and SK+ were as follows: Step 1 : 94*C, 3 min; Step 2: 
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j^j^^-qjCiUsj 20 limes; 



94 *C. 15 sec; Step 3: 57'C 1 min; Slq> 4: 68 *C. 2 min; Step 3: Steps 2 
Step 6: 68"C. 5 min; Step 7: storage tt 4*C 

Tbe crmrrmrmjoa of DNA in etch well was determined by oTtfpensing 100 pi PICOGREEN 
qi m i m i at ta n reagent (0.25* (vfv) PICOGREEN; Molecular Probes. Eugene OR) dissolved to IX TE 
and OJ ul of onfihflo) PCR product too each wdl of an opaque fhioriraeter plate (Corning Casta*. 
Acton MAX allowing the DNA to bind to the reagent. The plate was scanned in a Fluoroskan 11 
(Labsystens Oy, Helsinki, Finland) lo measure the fhjorescence of the sample and to Quantify the 
concentration of DNA. a 5 vi to 10 pi aliquot of the reaction mixture was analyzed by ekrtrophoresis 
on a 1 % agarose gel to determine which reactions were successful In extending the sequence. 

The extended nucleotides were desalted and concentrated, transferred to 3*4- well plates, 
digested widi CMJ1 cholera vims endormclray (Molecular Biology Research. Madison WI), and 
sonicated or sheared prior to retigatlon Into pUC 18 vector (Amersham Pharmacia Biotech). For 

n (0.6 to 0.8%) agarose 
1, and agar digested with Agar ACE (Promega). Extended clones were 
retlgatcd using T4 ligase (New England Blolabs. Beverly MA) Into pUC 18 vector (Amersham 
Pharmacia Biotech), treated with Pfu DNA polymerase (Stratflgene) to fill-in restriction site overhangs, 

n antibiotic-containing 
e picked and cultured overnight at 37*C in 384-wc0 plates in LB/2x 



The cells were tysed, and DNA was amplified by PCR using Taq DNA polymerase (Amersham 
Pharmacia Biotech) and Pro DNA polymerase (Stratagem) wtth tbe following parameters: Step 1: 
94'C.3min;Step2:94*C.15sec;Step3:60*C. I mm; Step 4: 72*C. 2 min; Step 5: steps 2, 3, and 4 
repeated 29 times; Step 6: 72*C 5 min; Step 7: storage at 4*C. DNA was quantified by PICOGREEN 
reagent (Molecular Probes) as described above. Samples with low DNA recoveries were rcamplifled 
using the ttme condjUora as described above. Samples were diluted with 20% d3methysotfoxide(l:2, 
v/v), and se qu e n c ed using DYENAMIC energy transfer sequencing primers and (he DYENAMTC 
DIRECT kit (Amersham Pharmacia Biotech) or the AB J PRISM B1GDYE Terminator cycle 



30 are used to obtain 5' regulatory sequences using the above procedure along 
designed for such extension, and an appropriate genomic library. 
IX. Labeling and Use of Individual Hybridization Probes 

Hybridization probes derived from SEQ ID NO:22-42 are 
DN As, or mRNAs. Although the labeling of oligonucleotides, consisting 



een cDNAs, 
of about 20 base pairs, is 



Oligonucleotides are designed using suft>oMhe-an software such as OL1CO 4.06 software (National 
^ Biosciences) and l a b ele d by co nt n hto g 50 pmol of each oligomer, 250 j/Q of hf" , *P) adenosine 
triphosphate (Amersham Pharmacia Biot e ch), and T4 polynucleotide k in ase (DuPont NEN, Boston 
5 MA). The labeled otigocacleockfcs are tubstantiafly purifled using a SEPHADEX 0-25 snporflnc size 
exclusion do Iran bead column (Amersham Pharmacia Biotech). An a l iquot containing 10* counts per 
minute of the la b eled probe is used in a typical mon bi ane-based bybrfcflzation analysis of human 
genomic DNA digested with one of the following errtocucleases: Ase 1. BgJ n. Eco RI, Pst I. Xba I. or 
Pvu n (DuPont NENX 

10 The DNA from each digest is fractionated on a 0.7% agarose gel and transferred to nylon 

membranes (Nytran Plus. Schleicher A SchucD, Durham NH). Hybridization Is carried out for 16 
hours at 40*C. To remove nonspecific signals, blots are sequentially washed at room temperature 
under conditions of up to, for example. 0.1 x saline sodium citrate and 0.5% sodium dndecyl sulfate. 

y or an alternative imaging means and 



photolithography, piezoelectri c printing (ink-Jet printing. See. eg., B aldeschwd ler, IJBaaV X mechanical 
uBCiosponing technologies, and derivatives thereof. The substrate in each of the afuuuenlioned 

20 technologies should be uniform and solid wfah a non-porous surface fSchena (19991 sopn\ Suggested 
substrates include silicon, sdca. glass slides, glass chips, and silicon wafers. Alternatively, a procedure 
analogous to a dot or slot blot may also be used to arrange and Dnk ekinenu to the surface of a 
substrate using thermal, UV, chemical, or mechanical bonding procedures. A typical array may be . 
produced using available methods and machines wcD known to those of ordinary skSl in (he art and nay 

25 contain any appropriate number of ele m ents. (See, eg.. Schcna, M. ct al. (1995) Science 270:467-470; 
Shalon, D. et al. (1996) Genome Res. 6:639-645; Marshall, A. and I. Hodgson ( 1 998) Nat BJctechnoL 
16:27-31.) 



ts of the microarray. Fragments or digomen suitable for hybridization can be 
I using software well known in the art such as LASER GENE software (DNASTAR). The array 
s are hybridized with polyn ucl eotides in a biological sample. The polynucleotides in tbe 



biological sample are conjugated to a fluorescent labd or other molecular tag for ease of detection. 
After hybridization, nonbybrkuzed nucleotides from tbe biological sample are removed.' and a 
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desorbtion and mass spectrometry may be used Tor detection of hybridization. The degree of 
complementarity and the relative abundance of each polynucleotide which hybridizes to an element on 
the micrcorray may be assessed. In one embodiment, microarray preparation and usage is described in 
detail below. 

Total RNA is isolated from tissue samples using the guanldintum ttuocyanate method and 
poiy(A)* RNA is purified using the oligcKdT) cellulose method. Each poly (A)* RNA sample is 
reverse transcribed using MMLV reverse-transcrtptase, 0.05 pg/ui oligcKdT) primer (2 1 mer), IX first 
strand buffer. 0.03 unitsAd RNnse inhibitor. 500 uM dATP. 500 uM dCTP. 500 uM dTTP. 40 pM 

10 dCTP. 40 uM dCTP-Cy3 (BDS) or dCTP-Cy5 (Amersham Pharmacia Biotech). The reverse 
transcription reaction is performed in a 25 ml volume containing 200 ng poty(A)* RNA with 
GEM BRIGHT kits Oocyte). Specific control poly(A)* RNAs are synthesized by in vitro ^inscription 
from non-coding yeast genomic DNA. After incubation at 37*C for 2 to, each reaction sample (one 
with Cy3 and another with Cy5 labeling) is treated with 2.5 ml of 0.5 M sodium hydroxide and 

15 incubated for 20 minutes at 85*C to the stop the react! on and degrade the RNA. Samples arc purified 
using two successive CHROMA SPIN 30 gel filtration spin columns (CLONTECH Laboratories, Inc. 
(CLONTECH). Palo Alto CA) and after combining, both reaction samples are etbanol precipitated 
using 1 ml of glycogen (1 mg/ml), 60 ml sodium acetate, and 300 ml of 100% ethanol. The sample is 
ihen dried to completion using a SpeedVAC (Savant Instruments Inc. Hoi brook NY) and 

20 resuspended In 14 ul 5X SSC/0.2% SDS. 
MicmarraT Preparation 

Sequences of the present invention are used to generate array dements. Each array element is 
amplified from bacterial cells containing vectors with cloned cDNA Inserts. PCR amplification uses 
primers complementary to (he vector t rqnenm flanking the cDNA insert. Array elements are 

25 amplified in thirty cycles of PCR from an initial quantity of 1-2 ng to a final quantity greater than 5 
ug. Amplified array elements are then purified using SEPHACRYL-400 (Amersham Pharmacia 

Purified array el e me n ts are Immobilized on polymer-coated glass slides. Class microscope 
slides (Coming) are cleaned by ultrasound in 0.1% SDS and acetone, with extensive distilled water 
30 washes between and after treatments. Glass slides are etched in 4% hydrofluoric acid (VWR 

Scientific Products Corporation (VWR), West Chester PAX washed extensively in a i titled water, and 
coated with 0.05% aminopropyl sOane (Sigma) in 95% ethanol. Coated slides are cured in a 1 10*C 
oven, 

■ Array elements are applied to the coaled glass substrate using a procedure described in US 
35 Patent No 5,807.522 . incorporated herein by reference, I uJ of the array element DNA, at an average 

70 , 
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concentration of 100 ng/ul, is loaded Into the open capillary printing element by a high-speed robotic 
apparatus. The apparatus then deposits about 5 nl of array element sample per slide, 

Microarrays are UV-crosslinked using a STRATAL1NKER UV-crossllnker (Stratagene). 
Microarrays are washed at room temperature once in 0.2% SDS and three times in distilled water. 
5 Non-specific binding sites are blocked by incubation of microarrays in 0.2% casein in phosphate 
buffered saline (PBS) (TropJx, Inc. Bedford MA) for 30 minutes at 60* C followed by washes in 
0.2% SDS and distilled water as before. 
Hvbridhtarlon 

Hybridization reactions contain 9 ul of sample mixture consisting of 0.2 ug each of Cy3 and 
10 Cy5 labeled cDNA synthesis products in 5X SSC, 0.2% SDS hybridization buffer. The sample 

mixture is heated to 65*C for 5 minutes and is aliquotcd onto the microarray surface and covered wtth 
an 1.8 cm* coverslip. The arrays are transferred to a waterproof chamber having a cavity just slightly 
larger than a microscope slide. The chamber is kept at 100% h um idity internally by the addition of 
140 ul of 5X SSC In a corner of the chamber. The chamber containing the arrays is incubated Tor 
15 about 6.5 hours at 60* C The arrays are washed for 10 min at 45*C in a first wash buffer (IX SSC, 
0.1% SDS), three times for 10 minutes each at 45 *C in a second wash buffer (0.1X SSC), and dried. 
Detection 

Reporter-labeled hybridization complexes an detected with a microscope equipped with an 
Innova 70 mixed gas 10 W laser (Coherent, Inc. Santa Clara CA) capable of generating spectral tines 
20 at 488 nm for excitation of Cy3 and at 632 nm for excitation of CyS. The excitation laser light Is 
focused on the array using a 20X microscope objective (Nikon, Inc. Melville NY). The slide • 
containing the array to pi aced on a compu ler-controlled X-Y stage on the microscope and raster- 
scanned past the objective. The 1.8 cm x 1.8 cm array used in the present example is scanned with a 

j 25 In two separate scans, a mixed gas multiline laser excites the two fhjoroptoures sequentially. 

Emitted light is split, based on wavelength, into two photomultipbcr tube detectors (PMT R1477, 
Hamamatxn Ptxxonics Systems, Bridgewater NJ) corresponding to the two nuorophorea. Appropriate 
filters positioned between the array and the prx*omnltiplier tubes are used to filter the signals. The 
emission maxima of the froorophorcs used are 565 nm for Cy3 and 650 nm for Cy5. Each array ts 

30 typically scanned twice, one scan per fluorophore using the approp ri ate filters at the laser source, 
although tbe apparatus to capable of recording the spectra from both ftoorophorcs simultaneously. 

The sensitivity of the scans is typically calibrated using the signal intensity generated by a 
cDNA control species added to (he sample mixture at a known concentration, A specific location on 
the array contains a co m ple m e n tary DNA s eq ue nc e, allowing the intensity of (he signal at (hat 

33 location to be correlated with a weight ratio of hybridizing species of 1:100,000. When two samples 
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from different sources (eg., representing tea and control cells), 
fluccopbore, are hybridized to a single array for die purpose of identifying genes that are differentially 
expressed, Ibe calibration is done by labeling samples of the calibrating cDNA wtib (be two 
fluorophores and adding identical amounts of each to the hybridization mixture. 

The output of the photomultipiier tube is digitized using a 12-tril RT1-835H anslog-io-dlgiial 
(A/D) conversion board (Analog Devices, Inc. Norwood MA) installed in an IBM-compatible PC 
computer. The digitized data are displayed as an image where the signal intensity is mapped using a 
linear 20-cotor 0»nsfonnailon to a pseudocolor scale ranging from blue (tow signal) to red (high 
signal). The data Uaa»»nalyzed quantitatively. Where two different fhicrophores are excited and 
measured simultaneously, the data are first corrected for optical crosstalk (due to overlapping 
emission spectra) between the fruorophores using each fluorophore's emission spectrum. 

A grid is superimposed over the fluorescence signal image such that the signal from each spot 
• is centered in each element of the grid. The fluorescence signal within each dement is then integrated 
to obtain a numerical value corresponding to the average intensity of the signal. The software used 
for signal analysis Is the GEMTOOLS gene expression analysis program Oocyte). 

XI. Complementary Polynucleotides 

Sequences complementary to the GCREC-cncoding sequences, or any parts thereof . are used to 
detect, decrease, or inhibit expression of naturally occurring GCREC. Although use of oUgonncleotide* 
comprising from about 15 lo 30 base pairs is described, essentially the same procedure is used with 
smaDcr or with larger sequence fragments. Appropriate oUgrjrJUdeoddCS are designed using QUGO 
4.06 software (National Biosciences) and the coding sequence of GCREC. To inhibit transcription, a 
cornplernerflarydigcro^^ 

promoter Wrsliruj to ihe coding sequence To inhibit translation, a complementary oligonudcotide is 
designed to prevent ribosomal binding to the GCREC-encocfing transcript. 

XII. Expression of GCREC 

. Expression and purification of GCREC is achieved using bacterial or virus-based expression 
systems. For expression of GCREC in bacteria. cDNA is subcloned into an appropriate vector 
contalmng an antibiotic resistance gene and an inducible promoter lhai directs high levels of cDNA 
transcription. Examples of such promoters Include, but are not limited to, the trp-lac <wc) hybrid 
promoter and the T5 or T7 bacteriophage promoter m conjunction with the toe operator regulatory 
rtrmrrn Recombinant vectors are Uansfcrmed into suitable bacterial basts, eg.. BL21(DE3). 
Antibiotic resistant bacteria express GCREC upon induction with tsopropyl beta-D- 
ihiog^aaopyranoside (IPTG). Expression of GCREC in eokaryotic cells is achieved by infecting 
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(AcMNPV), commonly known as bacutovirus. The nocesserajal pcryhedrin gene of bacnlovirus is 
replaced with cDNA enccdmg GCREC by either homologous recornbination or bacterial-mediated ' 
transposition involving transfer ptasrmd rnirnnrrtritrs. Wal infectivtry is maintained and the strong 
pdvhedrin promoter drives high teveb of cDNA transcription. Reccrnbinanl bacutovirus is used to 
infect Spodootera frucrperda (Sf9) insect eclh in most cases, or human hepatocytes, in some cases. 
Infection of the laiur requires additional genetic modifications to boculorirus. (See Engelhard, E.K. et 
al (1994) Proc Naff Acad. Sd. USA 91:3224-3227; Sancfig. V. etal. (1996) Hum. Gene Ther. 
7:1937-1945.) 

In roost expression systems, GCREC is synthesized as a fusion protein with, eg., glutathione 
S-transferasc (GST) or a peptide epitope tag. such as FLAG or 6-His, permitting rapid, single-step, 
affinity-based purification of recombinant fusion protein from crude cefl lysates. GST. a 26-kOodalton 
enzyme from Schistosoma iapodqim. bbMh the, poriiication ft"*™ p™"*™ 
glutathione under conditions that maintain protein activity and antigenicity (Anaersham Pharmacia 
Biotech). Following purification, the GST exxefy can be pnxeoryticaDy deaved from GCREC at 
specifically engineered sites. FLAG, an 8 -amino acid peptide, enables imrnunoafTmity p 



using comrnerdaDy available monoclonal and polyclonal anti-FLAC antibodies (Eastman Kodak). 6- 
His, a stretch of six consecutive MsticSne residues, enabies purification on metal-cheiale resins 
(Q1AGEN). Methods for protein expression and purification are discussed in Ausubd (1995. supra. 
ch. 10 and 16). Purified GCREC obtained by these methods can be used directly in the assays shown in 
Examples XVI, XVII, and XVIII, where applicable 
XIII. Functional Assays 

GCREC function is assessed by expressing the sequences encoding GCREC al physiologically 
elevated levels in mammalian ceil culture systems. cDNA is subcloned into a mammalian expression 
vector containing a strong promoter thai drives high levels of cDNA expression. Vect ors of choice 
include FCMV SPORT (Life Technologies) and PCR3. 1 (Invrtrogen, Carlsbad CAX both of which 
contain the cytomegalovirus promoter. 5-10 ng of recombinant vector are transiently Cransrected into a 
human cdl line, for example, an endothelial or herrBtrxxieiic cefl line, using either liposome 
formulations or dectropcr ation. 1-2 of an additional pUsmid containing sequences encoding a 
marker protein are co-transfectod Expression of a marker protdn provides a means to distinguish 
transfectcd cefls from ncntransfected ceils and is a reliable predictor of cDNA expression from the 
recombinant vector. Marker proteins of choice tochide, eg.. Green fluorescent Protein (GFP; 
Clomech). CD64. or a CD64-GFP fusion protein. Flow cytometry (PCM), an automated, laser optics- 
based technique, is used to identify transected cells expressing GFP or CD64-GFP and to evaluate the 
apoptotic stale of the cefls and other cellular properties. FCM detects and quantifies the uptake of 
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fluorescent molecules that diagnose events preceding or coincident with cell death. These events include 
changes In nuclear DNA content as measured by staining of DNA with propkiium iodide; changes in 
cdl size and granularity as measured by forward light scatter and 90 degree side tight scatter; down- 
regulation of DNA synthesis as measured by decrease in bTOmodeaxyuridJne uptake; alterations in 
5 expression of cell surface and intracellular proteins as measured by reactivity with specific antibodies; 
and alterations in plasma membrane composition as measured by the binding of ffaarcscein-ccnjugaied 
Annex in V protdn to the cdl surface. Methods In flow cytometry are discussed in Ormerod. M.G. 
(1994) Fk)w Cytometry. Oxford, New York NY. 

The influence of GCREC on gene expression can be assessed using highly purified populations 

10 of cells transfected with sequences encoding GCREC aixJ dlher CD64 or CD64-CFP. CD64 and 
CD64-GFP are expressed on the surface of trensfectcd cdte and bind to conserved regions of human 
immunoglobulin G (IgG). Transfected cefls are efficiently separated from noncransfected cells using 
magnetic beads coaled with either human IgO or antibody against CD64 (DYNAL, Lake Success NY). 
mRNA can be purified from ihe cefls using methods well known by those of skill In the an. Expression 

15 of mRNA encoding GCREC and other genes of interest can be analyzed by northern analysis or 
microarray techniques. 

XIV. Production of GCREC Specific Antibodies 

GCREC substantially purified using pdyacryJamkJe gd eleclrcphor esis (PAGE; see, eg., 
Harrington, M.G. (1990) Methods Enzymol. 182:488-495). or other purification techniques, is used to 

20 Immunize rabbits and lo produce antibodies using standard protocols. 

Alternatively, the GCREC amino add sequence is analyzed using LASER GENE software 
(DNASTAR) to determine regions of high inununogentcity. and a corresponding obgopeptlde is 
synthesized and used to raise antibodies by means known to those of skin in the art. Methods for 
sdection of appropriate epitopes, such as those near the C-terminus or In hydrophillc regions are wen 

25 described to the art (See eg.. Ausubd. 1995. supra, ch 111 

Typically, oligopeptides of abort 15 residues 431A 
peptide synthesizer (Applied Blcsystems) using FMOC cherdstry and coupled to KLH (Slgma- 
AWrich, St. Louis MO) by reaction with N-nuudmidoc«izoy1-N-hy^^ ester (MBS) to 

increase immunogenidty. (See, eg., Ausubd. 1995, msJt ) Rabbits are immunized with (he 

30 oligopeptide- KLH complex in complete Freundx adjuvant. Resulting antlscra are tested for antlpcptide 
and ami- GCREC activity by, for example, binding the peptide or GCREC to a substrate, blocking with 
1% BS A. reacting with rabbit antlscra, washing, and reacting with radio- iodinaicd goat anti-rabbit IgG. 

XV. Purification of Naturally Occurring GCREC Using Specific A ntibodks 
Naturally occurring or recombinant GCREC is substantially purified by Inununoaffinity 
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chromatography using antibodies specific for GCREC. An iromuncafflnity column is constructed by 
covalently coupling antl-GCREC antibody lo an activated chromatographic resin, such as 
CNBr- activated SEPH AROSE (Amersham Pharmacia Biotech). After ihe coupling, the resin is 
blocked and washed according to the manufacturers instructions. 
5 Media containing GCREC are p a sse d over the inununoaffinity column, and Che column is 

washed under conditions thai allow the p r efe rential absorbance of GCREC (eg., high ionic strength 
buffers in the presence of detergent). The column is dutcd under conditions that disrupt 
ami body/ GCREC binding (eg., a buffer of pH 2 to pH 3, or a high concentration of a uhaotrope, such 
as urea or thkxyanate too), and GCREC is collected. 

10 XVI. Identification of Molecules Which Interact with GCREC 

Molecules which interact with GCREC may include agonists and antagonists, as well as 
molecules involved in signal transduction, such as G proteins. GCREC. or a fragment thereof, is 
labeled with '"I Botton-Humer reagent. (See, eg.. Bolton A.E. and W.M. Hunter (1973) Biochem J. 
133:529-539.) A fragment of GCREC includes, for example, a fragment comprising one or more of the 

15 three extracellular loops, rhe extracellular N-ternrinal region, or the third intracellular loop. Candidate, 
molecules previously arrayed in the wells of a multi-wen plate are incubated with the labeled GCREC, 
washed, and any wdls with labeled GCREC complex are assayed Data obtained using different 
concentrations of GCREC are used lo calculate values for Ihe number, affinity, and association of 
GCREC with the candidate ligand molecules. 

20 Al tematively, molecules interacting with GCREC are analyzed using the yeast two-hybrid 

system as described in Fields, S. and O. Song (1989) Nature 340:245-246, or using commerdafly 
available kits based on the two-hybrid system, such as the MATCHMAKER system (Ckmtech). 
GCREC may also be used in the PATH CALLING process (CuraGen Corp., New Haven CT) which 
employs the yeast two-hybrid system In a high-throughput manner to determine all interactions between 

25 the protdns encoded by two large libraries of genes (Nindabalan, K. ct al (2000) U.S. Patent No 
6.057.101). 

Potential GCREC agonists or antagonists may be tested (or activation or inhibition of GCREC 
receptor activity using rhe assays described in sections XVII and XVIII. Candidate molecules may be 
sel e c t ed from known GPCR agonists or antagonists, peptid e libraries, or combin atorial chemical 
30 libraries. 

Methods for delecting interactions of GCREC with intracellular signal transduction molecules 
such as G proteins are based on the premise that internal segments or cytoplasmic domains from an 
orphan G protdn-coupled seven transmembrane receptor may be exchanged with the analogous 
domains of a known G protein-coupled seven transmembrane receptor and used to identify the G- 
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n !ece?MW5mains (1 



proteins and downstream signaling pathways activated by the orphan receJBKEmains (Kobilka. 
B.K. ct al. (1988) Science 240: 13 10-1 31 6). In «n analogous fashion, domain of the orphan receptor 
may be cloned as a portion of t fusion protein and used In binding assays to demonstrate interactions ■ 
with specific C proteins. Studies have shown that the third intracellular loop of G proteio-couplcd 
seven transmembrane receptors is important lor G protein interaction and signal transduction 
(Conklin. B.R. et al. ( 1993) CeJl 73:631-641). For example, the DNA fragment corresponding to the 
third intracellular loop of GCREC may be amplified by the polymerase chain reaction (PCR) and 
subekmed into a fusion vector sucb as pCEX (Pharmacia Biotech). The construa is transformed into 
an appropriate bacterial host, induced, and the fusion protein is purified from the cell lysate by 
ghj*atlndne-Sepharose4B (Pharmacia Biotech) affinity chromatography. 

For In vitro binding assays, cell extracts containing G proteins are prepared by extraction with 
50 mM Tris, pH 7.*, 1 mM EGTA. 5 mM MgCI „ 20 mM CHAPS. 2» glycerol. 10 pg of both 
aprotinin and leupeptin, and 20 ul of 50 mM r^vlmetirylsulfonyl fluoride. The lysate is incubated 
on ice for 45 min with constant stirring, centrifuged at 23.000 g ror 15 rain at 4fA and the 
supematam is collected. 750 ug of cell extract Is incubated with glutathione S-transferase (GST) 
fusion protein beads far 2 h at 4*C The GST beads are washed five times with phosphale-buffered 
saline. Bound G sobunils are detected by ("P)ADP-ribosyiaiion with pertussis or cholera toxins. The 
reactions are terminated by the addition of SOS sample buffer (4.6% (w/v) SOS, 10% (v/v) 
fr-mercarjtoethanol. 20* (w/v) glycerol. 95.2 mM Trts-HCl. pH 6.8, 0.0 1 * (w/v) brornphenol blue). 
The [^PJADP-labeled proteins are separated on 10* SOS-PAGE gels, and autoradJographed. The 
separated proteins in these gels are transferred to nitrocelrulose paper, blocked wittt blotto (5* nonfat 
dried milk, 50 mM Tris- HO (pH 8.0). 2 mMCaCl„80 mM NaO, 0.02* NaN„ and 05* Nonidet 
P-40) for 1 hour at room temperature, followed by Incubation for 1.5 hours witb Go subtype selective 
antibodies (1 300; Calbiochem-Novablochem). After three washes, blots are Incubated with 
horseradish peroxidase (HMO-conjugated goal anti-rabbit Immunoglobulin (1 :2000. Capnel. 
Westchester PA) and visualized by the chemihumncsxxnee^based ECL method (Amcrsharo Corp.). 
XVI 1. Demonstration of GCREC Activity 

An assay for GCREC activity measures the expression of GCREC on the cef) surface. cDNA 
encoding GCREC is trans fectcd into tn appropriate mammalian ceH fine. Cell surface proteins are 
labeled with Motto as described (de la Fucnte. M A. et al. (1997) Blood 90:2398-2405). 
Immu ooprccipitations are performed using GCREC -specific antibodies, and i mmu nccr ecipitated 
samples axe analyzed using sodium dodecyl sulfate poly aery I amide gel electrophoresis (SDS-PAGE) 
and immunoblotting techniques. The ratio of labeled irrmwnoprccJpitant to unlabeled 

roportional to the amount of GCREC expressed on the cell surface. 
In the alternative, an assay for GCREC activity is based on a prototypical assay for 
76 
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. This assay n 

synthesis in Swiss c 
to Quiescent 3T3 o 

transfected cells are then incubated in the presence of I'H (thymidine, a radioactive DNA precursor 
molecule. Varying amounts of GCREC ligand are then added to the cultured cells. Incorporation of 
[*H Jibyrnkflne into tcJd-predpttabte DNA b measured over an appropriate time interval using a 
radioisotope counter, and the amount incorporated Is directly proportional to the amount of oewty 
-synthesized DNA A linear dose-response curve over at least a hundred- fold GCREC ligand 
concen tra tion range Is mtBcative of receptor activity. One unh of activity per mflUllter b defined as the 
concentration of GCREC producing a 50* response level, where 100* represents m»Mn«t 
incorporation of (*Hphymir£ne into actd-preciptiable DNA (McKay, J. and I. Leigh, eds. 0993) 
Grgwtti Factors; A Poetical AlTPTqach. Oxford University Press, New York NY. p. 73.) 

In a further alternative, the assay for GCREC activity is based upon the ability of GPCR 
family proteins to modulate G protein-activated second messenger signal transduction pathways (eg.. 
CAMP; Churns. P. et al, (1998) J. Biol. Chem. 273:4990-4996). A plasmid encoding full length 
GCREC Is transfected into a mammalian cell line (eg.. Chinese hamster ovary (CHO) or human 
embryonic kidney (HEK-293) cell lines) using methods well-known in the an. Transfected cells are 
grown In 12- wen trays in culture medium for 48 hours, then the culture medium is discarded, and the 
attached cells are gently washed with PBS. The ceQs are then Incubated in culture medium with or 
without ligand for 30 minutes, then the medium is removed and certs lysed by treatment with 1 M 
perchloric add. The cAMP levels in the lysate are measured by radioimmunoassay using methods 
well-known in the art Changes in the leveb of cAMP in the lysate from ceils exposed to ligand 
compared to those without ligand are proportional to the amount of GCREC present In die transfected 



25 To measure changes In inositol phosphate levels, the cells are grown in 24- well plates 

containing IxlO 5 certs/wef] and incubated with inositol -free media and (^myoinositol, 2 uCltoefl. for 
48 nr. The culture medhora Is removed, and the cells washed with buffer containing 10 mM Lid 
followed by addition of ligand. The reaction is stopped by addition of perchloric add Inositol 
phosphates are extracted and separated on Dowex AG 1-X8 (BJo-R*d) anton exchange resin, and the 

30 total labeled inositol phosphates counted by liquid scinUDstion. Changes in the levels of labeled 

inositol phosphate from cells exposed to ligand compared to those without ligand are proportional to 
the amount of GCREC present in the transfected cells. 
XVIII. Identification of GCREC Uganda 

GCREC is expressed in a eukaryotic cell line such as CHO (Chinese Hamster Ovary) or HEK 



(Human Embryonic Kidney) 293 which have a good history of GPCR expression and which contain ■ 
wide range of G-protdra allowing for functional coupling of the expressed GCREC to downstream 
effectors. The transformed cells are assayed for activation of the expressed receptors in the presence of 
ca nd ida te ligands. Activity is measured by changes In intracellular second messengers, such as cyclic 
AMP or Co**. These may be measured directly using standard methods well known in the art. or by the 
use of reporter gene assays in which a luminescent protein (eg. firefly lucif erase or green fluorescent 
protdn) is under the transcriptional control of a promoter responsive to the stimulation of protein kinase 
C by me activated receptor (Mftligan. G. et al. (1996) Trends Pharmacol. Sd. 17:235-237). Assay 
technologies are available for both of these second messenger systems to aDow high throughput readout 
in multi-wen plate format, such as (he adenyryl cyclase activation FlashPlate Assay (NEN Life Sciences 
Products), or fluorescent Ca** indicators such as Fluo-4 AM (Molecular Probes) in combination with 
the RJPR fluorimetrk: plate reading system (Molecular Devices). In cases where the physiologically 
relevant second me ss enger pathway is not known, GCREC may be ccexpressed with the O -proteins 
G^y,, which have been demonstrated to couple to a wide range of G-proteins (Orfermanm. S. and M.I. 
Simon (1 995) J. Bka. Chem. 270:1 5 175- 15 1 80), in order to funnel the signal transduction of the 
GCREC through a pathway involving phosphoiipase C and Ca** mobilization. Alternatively. GCREC 
may be expressed in engineered yeast systems which lack endogenous GPCRs, thus providing the 
advantage of a null background for GCREC activation screening. These yeast systems substitute a 
human GPCR and G« protein for the corr esp on di ng componen ts of the endogenous yeast pheromone 
receptor pathway. Downstream signaling pathways are also modified so that the normal yeast response 
to the signal is converted to positive growth on selective media or to reporter gene expression (Broach, 
J.R. and J. Tbomcr (1996) Nature 384 (supp.):14-16). The re c ep to r s are screened against putative 
Uganda including known GPCR Ugands and other naturally occurring btaactfvc molecules. Biological 
extracts from tissues, biological fluids and cell si 




fields are intended to be within the scope of the following 
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What is claimed is: 

1 . An isolated polypeptide comprising an amino add sequence se lect ed from the group 
consisting of. 

5 a) an amino acid sequence selected from the group consisting of SEQ ID NO: 1-21, 

b) a naturally occurring amino acid sequence having at least 90% s eq ue nc e identity to an 
amino add sequence selected from the group consisting of SEQ ID NO:l-2 1, 

c) a biologically active fragment or an amino add sequence selected from toe group 
consisting of SEQ ID NO:l-21. and 

to d) an immunogenic fragment of an amino add sequence selected from the group consisting 

of SEQ ID NO: 1-21. 

2. An isolated polypeptide of claim 1 selected from the group consisting of SEQ ID NO:l- 

21. 

IS 

3. An isolated polynucleotide encoding a polypeptide of claim 1. 

4. An isolated polynucleotide encoding a polypeptide of claim 2. 

20 5. An isolated polynucleotide of claim 4 sdected from the group consisting of SEQ ID 

NO:22-«2. 

6. A recombinant polynucleotide comprising a promoter sequence operably linked to a 
polynucleotide of claim 3. 

7. A cell transformed with a reco mb i n ant polynucleotide of claim 6. 

8. A transgenic organism comprising a recombinant polynucleotide of daim 6. 

30 9. A method for producing a polypeptide of claim 1 . the method comprising: 

a) cuJ coring a cell under conditions suitable for expression of the polypeptide, wherein said 
cell is transformed with a recombinant polynucleotide, and said recombinant polynucleotide 
comprises a promoter sequence operably linked to a potyntKlcotide encoding the polypeptide of daim 
Land 

35 b) recovering the polypeptide so expressed. 



102 



103 



10. An isolated antibody which specifically binds 10 a 



• 

poIyp^HBTof c 



11. Ad isolated polynucleotide comprising • polynucleotide sequence selected from the 
8 of: 

a) a polynucleotide sequence selected from the group consisting of SEQ ID NO.22-42. 

b) a naturally occurring polynucleotide sequence having at least 90% sequence identity to a 
polynucleotide sequence selected from the group consisting of SEQ ID NO: 22 -4 2. 

c) a polynucleotide sequence complementary to a). 

d) a polynucleotide sequence complementary to b), and 

e) an RNA equivalent of a>d). 

12. Ao isolated polynucleotide comprising at least 60 contiguous nucleotides of a 
11. 



15 13. A method for delecting a target polynucleotide in a sample, said target polynucleotide 

having a sequence of a polynucleotide of daim 1 1 . the method comprising: 

a) hybridizing the sample with a probe comprising at least 20 contiguous nucleotides 
comprising a sequence complementary to said target polynucleotide in the sample, and which probe 
specifically hybridizes to said target polynucleotide, under conditions whereby a hybridization 

20 complex is formed between said probe and said target polynucleotide or fragments thereof, and 

b) detecting the presence or absence of said hybridization complex, and. optionally, if 



14. A method of daira 1 3. wherein the probe comprises at least 60 contiguous nucleotides. 

;. 25 

15. A method for detecting a target polynodeoti 
having » sequ enc e of a polynucleotide of daim 1 1, the n 

a) amplifying said target polynucleotid 
amplification, and 

30 b) detecting the presence or absence of said amplified target polynucleotide or fragment 

]y, if present, the amount thereof. 



in a 



16. A composition c 



g an effective amount of a polypeptide of daim 1 and a 
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17. A composition of claim 16. wherein the polypeptide comprises an amino add sequence 
g of SEQ ID NO: 1-2 1. 



18. A method for treating a disease or condition associated with decreased expression of 
5 functional GCREC. comprising ad m in i stering to a patient in need of such treatment the composition 
16. 



19. A method for screening a compound for effectiveness as an agonist of a polypeptide of 
daim 1. the method comprising: 

a) exposing a sample comprising a polypeptide of daim I to a a 

b) detecting agonist activity in the sample. 



20. A composition comprising an agonist compound identified by a method of claim 19 and 
a pharmaceutlcally acceptable exdpient- 



21. A method for treating a disease or condition associated with decreased expression of 
functional GCREC. comprising administering to a patient in need of such trearme 
daim 20. 



20 22. A method for screening a compound for effectiveness as an antagonist of a polypeptide 

Of daiXD I, the fr»rpfifv*i"g' 

a) exposing a sample comp ri sing a polypeptide of daim 1 to a compound, and 

b) detecting antagonist activity In the sample. 



) 25 



23. A 



bya 



24. A method for treating a disease or condition associated with overexpression of functional 
GCREC, comprising administering to a patient in need of such treatment a composition of claim 23. 



25. A method of screening for a compound that specifically binds to the polypeptide of claim 
1, said method comprising the steps of . 

a) combining the polypeptide of claim 1 with at least one test compound under suitable 



b) detecting binding of the polypeptide of daim 1 to the test compound, thereby identifying a 
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compound thai specifically binds to the polypeptide of claim 1. 

26. A method of screening for a compound thai modulates the activity of Ihe polypeptide of 



a) co m bi n i n g the polypeptide of daim 1 with at least one test compound under conditions 
permissive- for the activity of the polypeptide of daim 1, 

b) assessing the activity of the polypeptide of daim 1 In the presence of the lest compound, at 

c) comparing the activity of the polypeptide of claim I in the presence of the test compound 
with the activity of the polypeptide of claim 1 in the absence of the test compound, wherein a change it 
the activity of the polypeptide of claim 1 in the presence of the test compound is Indicative of a 
compound that modulates the activity of the polypeptide of daim I. 



27. A method for 
polynucleotide, wherein said target polynucleotide 



in altering expression of a target 
a sequence of claim 5, the method 



a) exposing a sample comprising the tar get polynucleotide to a compound, under conditions 



b) detecting altered expression of (he target p 

c) comparing the expression of the target polynadeotide in the pres 
the compound and in the absence of the compound. 



23. A method for assessing toxicity of a test compound, said method comprising: 

a) treating a biological sample containing nuddc adds with the test compound; 

b) hybridizing the nucleic adds of the treated biological sample with a probe comprising at 
least 20 contiguous nudeotides of a polynucleotide of claim 1 1 under conditions whereby a specific 
rrybridization complex is formed between said probe and a target polynucleotide In the biological 
sample, said target polynudeotide comprising a polynucleotide sequence of a polynucleotide of daim 
1 1 or fragment (hereof; 

c) quantifying the amount of hybridization complex; and 

d) comparing the amount of hybridization complex in the treated biological sample with the 
amount of hybridization complex in an untreated biological sample, wherein a difference in the 
amount of hybridization complex in the treated biological sample Is Indicative of toxicity of the test 



WO 01/57085 



PCTA1S01/034S5 



<120> C-PROTEIN COUPLED RECEPTORS 
<130> PI-0032 PCT 



<210> 1 

<211> 470 

<212> PRT 

<213> Homo sapiens 

*220> 

<22l> misc_feeture ' 

<223> Incyto ID No: 7472033CD1 

<220> 

<221> unsure 
<222> 59, 128, 309 
<223> unknown or other 



Ser Gly Tyr Ala S«r Ser Sar Asn 
20 

Arg His Pro Leu Asp Tyr Leu Asp 

35 

Asn Thr Thr Ala lie Asn Thr Ser 

SO 

Arg Pro Leu Asp Pro Val Leu He 

65 

Ala Val Asp Ser Pro Trp Tyr His 
80 

Val Leu lie Val Pbe Gly Ala Leu 
95 

Ser Pro ser ser Cly Ser Pro Sar 

110 

Fhe He Leu Asn. Leu Ala He Zaa 

125 

HI a Pro Ser Oly Leu Ser Asp Leu 
140 

Pro Leu Thr Leu Met Olu He Leu 
155 

Ser eye Ser He Leu Cys Lys Thr 
170 

Cys He Phe Val Ser Thr He Ser 
185 

Arg Tyr Gin val He Val Tyr Pro 

200 

Val Gly Ala Val Thr He Leu Ala 
21S 

Leu Leu Ala Ser Pro Leu Phe Val 

230 

Asp Thr Pro Ala Leu Leu Gin Cln 



Leu Ala 

10 

Ser Val 

25 

L«U Gly 
40 

Asp Leu 

55 



85 

Gly Asn 

100 
cys Ala 

115 
Oly Cln 

130 
Leu Leu 

145 
Ser Ly* 

ICO 
He Ala 

175 
He Thr 

190 
Thr Arg 

Gly He 
220 

Tyr Lya 
235 

He Gly 



Asp Gly Clu 
Arg Tyr Leu 
Thr val His 
Asn Glu Thr 
Phe Leu Ser 
He Ser Met 
Thr Leu Gly 
Leu Leu Ala 
Ser Ly* Cys 
Cy« Leu val 
Tyr Trp Pro 
Met Leu Gin 
Ala He Ala 
Asp Ser Leu 
Trp Ala Leu 
Clu Leu He 
Leu Gin Asp 



Tyr Cly 
90 

Cys Tyr 

105 
He Trp 

120 
Clu Ser 

135 
Thr Met 

150 

Ala Leu 

180 
Pbe Asp 

195 
Gin Phe 

210 
Ala Leu 

225 
Asn Thr 

24 0 



106 



1/31 











245 










250 ^s^^^^r 


Pro 


Tyr 


Cys 


Ila 


Clu 
260 


Asp 


Trp 


Pro 


Sar 


Arg Asn cly Arg Phe^ryr 

265 270 


Tyr 


Ser 


Ila 


Phe 


Sar 


Leu 


Cys 


Vol 


ClD 


Tyr Leu Val Pro He Leu 








275 








200 285 


lie 


Val 


Ser 


Val 


Ala 


Tyr 


Pha 


Cly 


Ila 


Tyr Asn Lys Leu Lys Ser 
295 300 












Ala 


Val 


Cln 




Arg Lys Val clu Arg Cly 












Thr 




Cys 


Leu 


Lou Ila Sar He Ala He 




















XI* 


p. _ 


Cly 


Val 


S*T 




Leu 


Pro 


Leu 


Asp Pha Pha Asn Leu Tyr 


















Ala 


Asp 


Met 


Clu 


Arg 


Ser 


Pro 


Val 


Thr 


Cln Sar Met Leu Val Arg 


Tyr 






Cy* 






Ila 


Gly Mac 




















Pro 


Leu 


Leu 


Tyr 


cly 


Trp 


Lau 


Asn 


Asp 


Asn Pba Arg Lys Clu Ila 








3B0 








3B5 390 


ClD 


Glu 


Leu 


Leu 


cy* 


Arg 


cys 


Bar Asp 


Thr Asn Val Ala Leu Asn 










395 






400 405 


ciy 


His 


Thr 


Thr 


cly 


cy. 


Aan 


val 


Cln 


Ala Ala Ala Arg Arg Arg 










410 








415 420 


Arg 


Ly> 


Tyr 


Cly 


Arg 


Arg 


Ila 


LOU 


Cln 


Arg Arg Thr Cln Ala Ala 






425 










430 435 


Cly 


Ala 


cly 


Cly 


Ala 
440 


Arg 


Ala 


Val 


Pro 


Arg Arg Gly Arg Arg Sar 
445 450 


Oly 


Gly 


Hia 


Arg 


Leu 


His 


Asp 


Arg 


His 


His Clu Gly Gly Leu Ala 










455 






460 465 


Asn 


11* 


Val 


HI* 


Hi* 
470 













<aio> 2 

<211> 326 
<212> PRT 
<213> Homo sapiens 

<220> 

<221> miac_feature 
<223> Incyte ID No: 



7472034CD1 
Asp 



Phe Arg Arg Ser He 3 
20 

Cly Xla Pro Cly Leu C 
35 

Leu Gly Ila Leu Tyr 1 

SO 

Leu Pha He He Trp 1 

SS 

Leu Pha Leu Sar Mat I 

ao 

Sar Thr Ala Pro Lys 
95 

Glu He Gly Tyr He 
110 

Ala Pha Ser Sar Met 
125 

Asp Arg Tyr Val Ala 
140 

Leu His Pro Cly Val 
155 

Arg Cly Leu Leu Leu 
170 

Leu Ha Phe Cys Cln 



Ila Ser 
Leu He 
Ser Cln 
Ala Leu 
Pro ser 
Ala He 
Ala Val 
Leu He 
Cly Val 
His Pro 
Arg Ho 



Ala Thr Ha He 



Clu Leu 
10 

25 
His Trp 

40 
Val Cly 

55 
Leu His 

70 



100 

Gin Met 
HS 

Leu val 
130 

Leu His 
145 

Oly Met 
160 

Pro He 
175 

Gly His 



Pro Glu 
Ala Pro 
He Ala 
Asn Val 
Gin ser 
Val Leu 
Val His 
Phe Phe 
Ala Met 
His Ser 
val val 
Leu Leu 
Ala Tyr 



Lys Clu 
15 

30 

Leu Pro 
45 

Thr He 
60 

Met Tyr 

75 

Ala Sar 
90 

Ala His 
105 
He His 

120 
Ala Leu 

135 
Thr He 

150 
Leu Val 

165 
Cly Thr 

100 
Cys Olu 



185 

His Met Ala Val Val 

200 

Arg Ala Tyr Gly Leu 
215 

V&l Leu Ala He Gly 

230 

Leu Lys Val Pro Gly 
24S 

Cys Cly Ser His He 
260 

Ha Pha Ser Phe Leu 

275 

His Val His Val Leu 
290 

Ala Leu Asn Pro Leu 

305 

Cln Arg Val Leu Arg 

<210> 3 

«211> 315 

<212> PRT 

<213> Homo sapiens 
«220> 

<221> mlsc_feature 

<223> Incyte ID 



190 

Lys Leu Ma Cys sar 

205 

Thr Met Ala Leu Leu 

220 

Val Ser Tyr Ala His 

235 

Sar Clu Ala Arg Leu 

250 

Cys Val He Leu Val 

265 

Thr His Arg Phe Cly 
280 

Leu Ala Thr Arg Tyr 
295 

val Tyr Cly Val Lys 
310 

val Phe Thr Cln Lys 

325 



MOs 747203 5CD1 



195 

Clu Thr Thr val Asn 
210 

Val He Gly Leu Asp 

225 

He Leu Cln Ala Val 
240 

Lys Ala Phe Ser Thr 

2S5 

Phe Tyr Val Pro Gly 
'270 

His His Val Pro His 
285 

Leu Leu Mat Pro Pro 

300 

Thr Cln Gin He Arg 



Cln Ser ser His Leu 
20 

He He Ala Leu Leu 

35 

Met Asp Ser Thr Arg 

50 

Leu Ala Ala Val Asp 

65 

Met Val Ser He Phe 
SO 

Ala Cys Phe Thr Gin 
95 

Clu Thr Oly Leu Leu 
110 

He Cys Lys Pro Leu 

125 

Met Leu Oly Mat Ser 
140 

He Thr Pro Leu Ser 
155 

sar Asn Val val Val 
170 

Arg Leu Ala Cys Ala 
185 

Ila Oly Ser Ser Lau 
200 

Ala Ser Tyr He Leu 
215 

Lys Thr Ala Oln Leu 
230 

Oly Val Met Ala Leu 
245 

Ala Ala Trp Leu Gly 
260 

Leu Leu Ala Asp Leu 



Trp Leu 
Gly Ash 
His Glu 
He Val 
Cys Ser 
Met Phe 
Leu Thr 
His Tyr 
Met Ala 
Trp Met 
His Ser 
Asp Pro 
Met Val 
Ho Leu 
Lys Ala 
Tyr Tyr 
Cln Asp 
Tyr Val 



Lau Leu Val 

10 

Ala He Ser 

25 

Thr He He 
40 

Pro Met Tyr 

Met Ala Ser 

70 

Gly Asp Ser 
85 

Pha Val His 

100 

Met Ala Phe 

115 

Lys Arg He 

130 

Ha Thr He 
145 

val Sar His 
160 

Tyr Cys Clu 
17S 

Val Pro ser 
190 

Cly Sar Asp 

20S 

Lys Ala val 
220 

Leu ser Thr 
235 

Lau Pro Cly 
250 

Val Val Pro 
265 

He He Pro 



Val Thr Ala 
Cys Pba Leu 
Ser Val Val 
Ser He Ser 
Leu Ala Thr 
Asp Arg Tyr 
Leu Thr Pro 
Arg Ala Ha 
Leu Pro Phe 
His Ila Ala 
Sar Leu Tyr 
Val Ala Phe 
Phe Cly Leu 
Cys Cly Ser 
Mat Ala Ser 
Leu His Thr 
Ala Thr Leu 



cly Lau 

IS 

XI. ^ 

cys val 
60 

Pro Lys 



90 

Ala Val 

105 
Val Ala 

120 
Gin Val 

135 
He Ala 

ISO 
Cys Oly 

165 
Lau Ala 

180 
Sar Leu 

195 
Ha Ala 

210 
Ser ser 

225 
His Val 

240 
He Tyr 

2S5 
Gin Val 

270 



275 280 285 

Ila He Tyr Gly Met Arg Thr Lys Cln Leu Arg Olu Arg He Trp 

290 295 300 

Sar Tyr Lau Mat His Val Lau Pha Asp His Ser Asn Leu Oly Sar 

305 310 315 



<210> 4 

<211» 314 

<212> PRT 

<213> Homo sapiens 

<220> 
' <221> misc_feature 
<223> Incyte ID No; 



7472036CD1 



<400> 4 








Hat 


Sar 


Ala 


Sar 


Asn 


1 








5 


Lau 


Val 


Gly 


He 


Pro 








20 


lie 


Pro 


Pha 


Cys 


Lau 








35 


Thr 


Lau 


Lau 


Lau 


Ila 
50 


Mat 


Tyr 


Lau 


Pha 


Leu 
65 


Ser 


Ser 


Ser 


Ala 


Lau 
80 


A*p 


Arg 


Clu 


Ho 


Aon 








95 


Lau 


His 


Ser 


Phe 


Sar 
110 


Ala 


Pha 


Asp 


Arg 


Tyr 
125 


Lys 


Val 


Lau 


Thr 


cly 








140 


Val 


Ala 


Arc 


Ala 


val 








1SS 


Arg 


Cys 


Pha 


His 


Tyr 
170 


cys 


Glu 


His 


Mat 


Ala 








185 


Pba 


Asn 


Asn 


He 


Tyr 
200 


Lau 


Asp 


Leu 


Lau 


Lau 

215 


Ala 


Val 


Lau 


Lau 


Lau 

230 


Oly 


Thr 


Cy* 


Val 


Sar 








245 


Thr 


val 


val 


He 


sar 

260 


Ala 


Pro 


Hi* 


Val 


His 

275 


Pro 


Pro 


Mat 


val 


Asn 

290 


Ila 


Arg 


olu 


ser 


Ha 

305 



Leu Olu 
Tyr Thr 
Cln Ala 
Mat Leu 
Lys Mat 
Pha Ala 
Met Olu 
Ala He 
Leu He 
Leu Met 
Arg Cly 
Val Arg 
He Ala 
He Leu 
ser cln 
He Cly 
: val Met 
» Leu Leu 
> He He 
i Gly Val 



Thr His Pro 
10 

His Leu His 

25 

Lau Ala Lau 
40 

Asp Ala Ala 

55 

Ala Ala He 

70 

Leu Ala He 
85 

Cys Leu Ala 

100 
ser Ala Val 

115 
Cys Lys Pro 

130 
Thr Lys Ha 

14S 
Thr Pro Leu 

160 
Pro Val Ha 

175 
Leu Ala Cys 

190 
Val Ala Met 

205 
Ser Tyr He 

220 
Olu Ala Arg 

235 
Ala He Leu 

250 
His Arg Val 

Ala Asn Phe 

280 
Tyr Cly Val 
295 



Thr Ala 
He Trp 
Leu Gly 
Leu His 
Asp Leu 
Phe Trp 
Gin Met 
Lau Leu 
Lau His 
Gly Met 



Gly Asp 
Phe He 
Phe Ila 
Tyr Lys 
Ala Phe 
Ala Arg 
Tyr Lau 
Lys Thr 
Lys Asp 



IS 

He Ser 

30 

Asn Cys 
45 

Olu Pro 
60 

Val Leu 

75 

Pha Arg 

90 

Phe Pha 

105 
Ala Mat 

120 
Tyr Thr 

135 
Ala Ala 

150 
Leu Leu 

165 
Cys Tyr 

180 
Thr Ser 

195 
Val Val 

210 
Leu Oln 

22S 
Ala Pha 

240 
Tyr Thr 

2SS 
His Ala 

270 
Lau Phe 

285 
Lys cln 

300 



<210» 5 

<2H> 321 

<212> PRT 

<213> Homo sapiens 
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<223> Incyte ID 
<400> 5 



PCT/US01/03455 



Not 7472037CD1 



Met 


Ala 


His 


Cln 


Ala 


Pro 


Glu 


Lys Gin Gin 


Asp Asn 


Cly Thr Trp 


1 








5 






10 




IS 


Leu 


val 


Thr 


Olu 


Pba 

20 


Lau 


Lau 


Val Oly Phe 

25 


Ser Asn 


Leu Pro Clu 
30 


Lau 


Arg 


Pro 


Thr 


Lau 


Pha 


Ha 


Leu Phe Lau 


Lau Thr 


Tyr Leu Val 








35 






40 




45 


Thr 


Lau 


ser 


Oly 


Asn 


Ala 


Thr 


He He Thr 


He He 


Cln Val Asp 








50 






55 




60 


Arg 


Thr 


Lou 


His 


Thr 
65 


Pro 


Mat 


Tyr Arg Phe 

70 


Leu Ala 


Val Lau Ser 

75 


Lau 


Sar 


Clu 


Thr 


cys Tyr 


Thr 


Lau Val Thr 


He Pro 


Asn Met Leu 










60 






85 




90 


Ala 


His 


Lau 


Lau 


Mat Glu 
95 


sar 


Gin Ala He 
100 


Ser He 


Ala Gly cys 


Arg 


Ala 


Oln 


Mat 


Pha 

110 


Pha 


Pha 


Leu Oly Leu 

HS 


Cly Cys 


Ser His cys 


Pha 




Lau 


Thr 


Leu 


Mat 


Gly 


Tyr Asp Arg 


Tyr Val 


Ala He Cys 










125 




130 




135 


His 


Pro 


Leu 


Arg 


Tyr 


Sar 


val 


He Mat Arg 


Pro Thr 


val cys Leu 








140 






14S 




150 


cy* 


Lau 


oly 


Ala 


Lau 

1S5 


Val 


Pha 


Cys Ser Cly 

160 


Phe Ser 


Val Ala Leu 
165 


Ha 


Clu 




cys 


Mat 

170 


He 


Pha 


Ser Ser Pro 
175 




Cly Ala Oly 
180 


His 


Val 


Clu 


His 


Pha 


Phe 


Cys 


Asp He Ala 


Pro Val 


Leu Lys Leu 










185 




190 




195 


Ser 


Cys 


Asp 


Clu 


Sar 

200 


Sar 


Leu 


Lys Cly Lau 
205 


Gly He 


Phe Phe Leu 
210 


Sar 


He 


Leu 


Val 


Val 
215 


Lau 


Val 


Sar Phe Leu 
220 


Pha Ha 


Leu Leu ser 

225 


Tyr 


Ala 


Phe 


Ha 


Val 


Ala 


Ala 


He Val Arg 


Ila Pro 


ser Ala Ser 








230 






235 




240 


Cly 


Arg 


Arg 


Lys 


Ala 
245 


Pha 


Sar 


Thr cys Ala 
250 


Ala His 


Lou Thr Val 
255 


Val 


Ila 


Val 


His 


Pha 


Cly 


cys 


Ala Ser He 


Ha Tyr 


Lau Arg Pro 










260 


265 


270 


Asp 


Sar 


Cly 


Ala 


Asn 

275 


Pro 


Sar 


Gin Asp Arg 
280 


Leu Val 


Ala Val Phe 

285 


Tyr 


Thr 


Val 


val 


Thr 


Pro 


Lau 


Leu Asn Pro 


val Val 


Tyr Thr Lau 








290 






295 




300 


Arg 


Asn 


Lys 


Olu 


Val 
305 


Arg 


Val 


Ala Leu Arg 


Lys Asn 


Leu Ala Arg 


Oly 


cys 


Cly 


Ala 


Phe Lys 


















320 













<210> 6 
<211> 331 
<212> PRT 
<213> Homo sapie: 

<220> 

<221> misc_feati 
<223> Incyte ID 



Noj 7472039CD1 



Met Ser Pro Asp 
1 


Gly 


Asn His Si 


tr Ser 


ASP 
10 


Pro Thr 


Clu 


Phe val 

15 


Leu Ala Gly Leu 


Pro 


Asn Leu At 


n Ser 


Ala 


Arg Val 


Clu 


Leu Phe 


20 






25 






30 


Ser val phe Leu 


Lau 
35 


Val Tyr n 


m Leu 


Asn 
40 


Leu Thr 


Oly 


Asn val 
45 


Leu He Val Cly 


val 


Val Arg Ala Asp 


Thr 


Arg Leu 


Cln 


Thr Pro 


50 






55 






60 


Met Tyr Phe Phe 


Leu 


Gly Asn L< 


iu Sar 


cy* 


Leu Clu 


He 


Lau Lau 
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65 


70 [ 












Val 


Ila 


Ila Pro Lys Mat 


Lau Sar Asn Pha Lau 




Val cln Val Ala 


Cvs 










80 


as 


- 90 








Hi a 


Thr 


Ila 


Ear Pba Ala Ala 


Cya Ila Thr Cln Pha 


Tyr Pha 


Thr Val Pro Ser 


Ala 










J5 


100 


105 










Phe 


Leu 


Cly Ala Sar Glu 


Pha Leu Lau Lau Ala 

115 


Val Met 


Pro Ala Arg Leu 


cln 










110 


120 








Asp 


Arg 


Tyr Leu Ala Ila 


Cys His Pro Lau Arg 


Tyr Pro 














125 


130 
















Cly Ala Val Cya 


Pha Axg Val Ala Lau 


Ala Cys 


cly pro Leu val 














145 


150 




Trp 


val 






Lao val Pro Val 


Lau Cly Pro Thr Val 


* 165 


Trp Aid Ala Ar o 


Arg 




















Lou 








Cly Ala Val Val cln 




Cly Pha Pha Val 






















Cya 






Civ Pro Lau Lau 


Arg Leu Ala Cya Thr 


Asn Thr 


Lau Thr Val Ala 


















Lys 


Lya 




Cl 


cl Thr ksd Phe 


Val Leu Ala Sar Leu 


Val Ila 


Arg Ala ciu Pro 


Leu 














275 


Val 


Ser 












Leu Asn Pro Mat 






















Leu 


Ala 








Ala Sar Cly Arg Cln 




Arg Sar Lau Pro 




















* 


Ser 


Th 


Cvs 


Hit Sor Kla Leu 


Ila Val val Thr Leu 


Pha Tvr 


Cly Cln Asp Olu 


a 


















Oly 


Ser 


Ala 


Ila 


Phe Lau Tyr Val 


Arg Pro Sar Cln Sar 


Cly Sar 


Leu Val Ser Clu 


Mot 








260 


265 


270 




335 


Val 


Asp 


Thr 


Asn 


Trp Ala Val Thr 


val Ila Thr Thr Pha 


Val Thr 












275 


280 


265 


<210> 8 




Pro 


Lou 


Leu 


Asn 


Pro Pha Ila Tyr 


Ala Leu Arg Asn Glu 


Cln Val 


<211> 1473 












290 


295 


300 


<212> PRT 




Ly« 


OlU 


Ala 


Leu 


Lys Asp Hat Pba 


Arg Lys val vol Ala 


Cly Val 


<213> Homo sapl< 












305 


310 


31S 




uu 


Cly 


Ann 


Leu 


Lau Lau Asp Lys 


Cys Leu Sar Olu Lys 


Ala Val 


<220> 












320 


325 


330 


<221> miac f«tur« 


Ly» 














<223> incyte ID 


No: 



PCT/US01/03455 



<210> 7 

<211> 337 

<212> PRT 

<213> Hono sapiens 

<220> 

<221> mi«c_f eature 
<223> Incyte ID No: 

<400> 7 

Met Cly Asn Asp sar 
1 5 
Leu sar Asp Arg Pro 
20 

Ila Asp Pro Leu Arg X 

3S 

Phe Leu Val Cly Val I 
50 

Oly Lys val Ala Arg 

Leu Ala Val Ala Asp 
80 

Ala Val Pro lie Ala 
95 

Oly Cys Arg Ala Leu 
110 

Ser Val Leu Leu Leu 

125 

Ala Leu Cly Pro Ala 



7472040CD1 



Val Ser 
val Asp 
Ala 
Pro cly 
Arg Arg 
Leu Leu 
Arg Cly 
Pro Ser 
Ala Ala 



Tyr clu Tyr Cly 
10 

Cys Leu Asp Cly 

Pro Leu Pro Lau 
40 

Asn Ala Met Val 

SB 

Val Cly Ala Thr 
70 

Cys Cys Leu Sar 

Cly His Trp Pro 
100 

lie Ila Leu Lau 

115 

Leu Sar Ala Asp 

130 

Sar Thr val cln 



Ala cys 
Tyr Ala 
Ala Trp 
Trp Lau 
Leu pro 
Tyr Cly 
Thr Mat 
Leu Cys 
Arg Ala 



Cly Ala 
Ila Tyr 
Cya Val 
Thr Ala 
val Ala 
Cys Arg 
Trp Ala 
Pro Asn 
Ila Val 
Pha Leu 
Ala Cys 
val Asp 
Clu Val 



Ala Trp 
Arg Arg 
val A»p 
Ila Axg 
sar Cys 
Pro Leu 
Pro Tyr 
Ser Ala 
Cly Leu 
Tyr Phe 
His Trp 
sar Lys 



14S 

Thr Leu Ala 
160 

Leu His Cln 

175 

Tyr Cly Cly 
190 

Phe Lau Phe 

205 

Hi a sar Ala 
220 

Cly Thr Ala 
235 

His Leu Leu 
350 

Leu Leu Ala 

265 

Ala Lou Ala 
280 

Cly Arg Ala 
29S 

Ala Leu Arg 

310 

Lys ser Thr 

325 



150 

Leu Lau Leu 

165 

Clu His Pha 
180 

Ser Sar Ser 
195 

Cly Phe Leu 
210 

Leu Lau Cys 
225 

lie val Val 
240 

Cly Leu Val 

255 

Arg Ala Leu 
270 

His Ser Cys 
285 

cln Lau Arg 
300 

Glu Sar Cln 
315 

Sar His Asp 

330 



4250893CD1 





<400> 8 




Mat Ala 




1 




Leu Lys 




Lys Ala 




Glu Lys 




Tyr Cly 


sar Asp 


^ Cln Mat 


Leu Ala 

30 


Cly His 


Ala lie 


Cly Sar 


45 


val Ala 

60 


He His 


Leu His 


Val Leu 


75 




He Leu 
90 


He Sar 


Ala Val 


His Glu 


105 




Tyr Ala 


Ala Val 


120 




Phe Leu 


Pro Arg 


135 


Cys Gly 


Thr Sar 



110 
i Pro 

125 
I Leu 

140 

His 

15S 
> ser 

170 
• Ser 

IBS 
i Clu 

200 



Trp Cly 
Leu Lys 
ser sar 
Met Clu 
Ala Trp 



Pro Thr 
Ala Gly 
Pro Asp 
Val Tyr 
cln' clu 
Trp Cly 
Ala Pro 
Tyr Thr 



Arg Leu 3 
Clu Phe i 
Sar Cly < 



Ala cys Tyr 
10 

Cln Lau Leu 

25 

Glu Thr Pro 
40 

Val Ala Ser Tyr Leu 
55 

Asp Leu Ala Leu Hia 
70 

cya Ala Cln Ala Gin 
Tyr Sar 



Cys Thr 
Thr Ser 
Gin Ala 
Ser Pro 
ser Pro 
Gly Thr 
Glu lie 



Pro ser clu 
100 
Ala Val Leu 
115 

Gin Cly Ser 
130 

Oly Arg Arg 
145 

Leu Pro Ser 
160 

Asn Ala Pro 
175 

Pro Cln Pro 
190 

Gin Trp Pro 
2 OS 

Arg Glu Axg 



Leu Clu Phe 

15 

Leu Ala Aan 
30 

Ala Cln Pro 
45 

Val Ala Gin 
60 ■ 

Thr Trp Glu 
75 

Clu Cly Ala 



120 

Olu Arg Arg 
13S 

Trp Arg Clu. 

150 

Ser Pro Asp 
16S 

Thr sar Thr 
180 

Ser Lau Ala 
195 

Leu Asp Glu 
210 

Clu Arg Glu 



WO 01/57085 

215 220 22S 

Lya Ser Glu Lys Gly Arg Pro Pro Trp Ala Ala val Val Gly Thr 

230 235 240 

Pro Pro Cln Ala Hia Thr Ser Leu Oln Pro His His His Pro Trp 

245 250 255 

Clu Pro Ser Val Arg Clu Ser Lau cys Ser Thr Trp Pro Trp Lys 

260 265 270 

Asn Olu Asp Phe Asn Oln Lys Phe Thr Oln Leu Leu Leu Leu Gin 

275 200 28S 

Arg Pro His Pro Arg Sar Cln Asp Pro Leu Val Lys Arg Ser Trp 

290 295 300 

Pro Asp Tyr Val Clu Olu Asn Arg Oly His Leu He clu He Arg 

30S 310 315 

Asp Leu Pha cly Pro cly Leu Asp Thr Cln Clu Pro Arg He Val 

320 325 330 

He Leu Gin Gly Ala Ala Oly lie Cly Lys Ser Thr Leu Ala Arg 

335 340 345 

oln val Lys clu Ala Trp Cly Arg Cly Oln Leu Tyr cly Asp Arg 

350 355 360 

Phe Cln His Val Phe Tyr Phe Ser Cys Arg Glu Leu Ala Gin Ser 

365 370 375 

Lys Val Val Ser Leu Ala Clu Leu Ila Oly Lys Asp Cly Thr Ala 

380 385 390 

Thr Pro Ala Pro He Arg Gin He Leu Ser Arg Pro Olu Arg Leu 

395 400 405 

Leu Phe He Leu Asp Oly Val Asp Clu Pro Oly Trp Val Leu Cln 

410 41S 420 

Olu Pro Ser Ser Clu Leu Cys Leu His Trp Ser Gin Pro Gin Pro 

425 430 435 

Ala Asp Ala Leu Leu Cly Sex Leu Leu Oly Lys Thr He Leu Pro 

440 445 450 

Clu Ala Sar Phe Leu He Thr Ala Arg Thr Thr Ala Leu Cln Asn 

45S 460 465 

Leu He Pro Ser Lau Clu Gin Ala Arg Trp Val Olu Val Leu Cly 

470 475 480 

Phe Ser Clu Ser Ser Arg Lya Glu Tyr Pha Tyr Arg Tyr Phe Thr 

485 490 49S 

Asp Clu Arg oln Ala He Arg Ala Phe Arg Leu Val Lys Ser Asn 

500 505 510 

Lys Olu Leu Trp Ala Leu Cys Leu Val Pro Trp Val ser Trp Lou 

515 520 525 

Ala Cya Thr Cys Leu Mat Oln Gin Met Lys Arg Lys Glu Lys Leu 

530 535 540 

Thr Leu Thr Ser Lya Thr Thr Thr Thr Leu cya Leu Hia Tyr Leu 

545 550 555 

Ala Oln Ala Leu Cln Ala Oln Pro Leu Cly Pro Oln Leu Arg Asp 

560 565 570 

Leu Cys Ser Leu Ala Ala Olu Oly He Trp Cln Lys Lys Thr Leu 

575 580 585 

Phe Sar Pro Asp Asp Lau Arg Lys His Cly Leu Asp Oly Ala He 

590 595 60D 

He Ser Thr Phe Leu Lys Met Oly He Leu Cln Clu His Pro He 

605 630 615 

Pro Leu Sar Tyr Ser Phe Ila His Leu Cys Phe Gin Clu Phe Phe 

620 625 630 

Ala Ala Mat sar Tyr Val Leu Olu Asp Clu Lys cly Arg Oly Lys 

635 640 645 

His Ser Asn Cya He He Asp Leu Olu Lys Thr Leu Glu Ala Tyr 

650 655 660 

cly Ha His cly Leu Pha Cly Ala Ser Thr Thr Arg Phe Leu Leu 

665 670 675 

Oly Lou Leu Ser Asp Clu Oly Olu Arg Olu Met Glu Aan Ha Pha 

680 685 690 

His Cys Arg Leu Sar Oln oly Arg Asn Leu Met Gin Trp Val Pro 

695 700 705 

Ser Leu Cln Leu Leu Lau Cln Pro His Ser Leu Glu Ser Leu His 

710 715 720 
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cys Leu Tyr Olu Thr Arg Asn Lys Thr Phe Leu Thr Cln Val Met 

' 725 730 735 

Ala His Phe Clu Glu Met Oly Met Cys Val Clu Thr Asp Met Glu 

740 745 750 

Leu Leu Val Cys Thr Phe Cys He Lys Phe Ser Arg His Val Lys 

7SS 760 765 

Lya Leu Gin Leu He Clu Cly Arg Cln His Arg Ser Thr Trp Ser 

. 770 775 780 

Pro Thr Met Val Val Leu Phe Arg Trp Val Pro Val Thr Asp Ala 

785 790 795 

Tyr Trp Cln He Leu Phe Ser Val Leu Lya Val Thr Arg Asn Leu 

800 805 810 

Lys Clu Leu Asp Leu Ser Cly Asn Sar Leu Sar His Sar Ala Val 

B15 820 825 

Lys ser Leu Cys Lys Thr Leu Arg Arg Pro Arg Cys Leu Leu Olu 

B30 835 840 

Thr Leu Arg Leu Ala Oly Cys Cly Leu Thr Ala Clu Asp Cys Lys 

845 850 855 

Asp Leu Ala Phe Gly Leu Arg Ala Asn Gin Thr Leu Thr Clu Leu 

B60 S6S 870 

Asp Leu Ser Phe Asn Val Leu Thr Asp Ala Cly Ala Lys His Leu 

875 880 885 

Cys Oln Arg Leu Arg Cln Pro Ser Cya Lys Lau Oln Arg Leu Cln 

B90 895 900 

Leu Val Ser Cys Gly Leu Thr Ser Asp Cys Cys Oln Asp Leu Ala 

905 910 915 

Ser Val Leu Ser Ala Sar Pro Ser Leu Lya Clu Leu Asp Leu Cln 

920 925 • 930 

Gin Asn Asn Leu Asp Asp Val Cly val Arg Leu Leu cys Clu Cly 

935 940 945 

Leu Arg His Pro Ala Cya Lys Leu He Arg Leu Cly Leu Asp Cln 

950 955 960 

Thr Thr Leu Ser Asp Clu Met Arg Cln Clu Leu Arg Ala Leu Clu 

965 970 975 

Cln Clu Lys Pro Cln Leu Leu Ha Phe Ser Arg Arg Lys Pro Ser 

980 985 990 

val Met Thr Pro Thr Clu Cly Leu Asp Thr Oly Olu Met Ser Asn 

995 1000 1005 

ser Thr Ser Ser Leu Lys Arg Cln Arg Leu Gly Ser Glu Arg Ala 

1010 1015 1020 



Lys He Phe Pro He Ala Clu Ila Ala Clu Glu Ser Ser Pro Olu 

1040 . 1045 1050 

Val Val Pro Val Olu Leu Leu Cys Val Pro Sar Pro Ala Ser Oln 

10S5 1060 1065 

Oly Asp Leu Hia Thr Lys Pro Leu Cly Thr Asp Asp Asp Phe Trp 

1070 1075 1080 

Cly Pro Thr Gly Pro Val Ala Thr Clu Val Val Asp Lys Olu Lys 

1085 1090 1095 

Asn Leu Tyr Arg Val Hia Phe Pro Val Ala Oly Ser Tyr Arg Trp 

1100 1105 1110 

Pro Asn Thr Oly Leu Cys Phe Val Met Arg Clu Ala Val Thr Val 

HIS 1120 1125 

Olu He Olu Phe Cys Val Trp Asp Cln Phe Leu Cly Olu He Asn 

1130 1135 1140 

Pro Oln Hia Ser Trp Met Val Ala Cly Pro Leu Leu Asp He Lys 

1145 1150 1155 

Ala Clu Pro Cly Ala Val Clu Ala Val His Leu Pro His Phe Val 

1160 1165 1170 

Ala Leu Gin Gly Cly His Val Asp Thr Ser Leu Pha Cln Mat Ala 

1175 1180 1185 

His Phe Lys Clu Clu Oly Met Leu Leu Olu Lys Pro Ala Arg Val 

1190 1195 1200 

Glu Leu His His He val Leu clu Asn Pro Sar Phe sar Pro Leu 

1205 1210 121S 

Cly Val Leu Leu Lys Met He His Asn Ala Leu Arg Phe He Pro 
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1220 




1225 






vol 


Thr 


8mt 


Val val 


Lou Lou Tyr 


Hi* Arg 


Val Hi* Pro 


cW**R 








12)5 


1240 




124 5 


val 


Thr 


Ph* 


HI* Ltu 


Tyr Lou 11* 


Pro Sor 


Asp Cy* Sor 


Ilo Arg 








1250 


1255 




1260 


Ly. 


Ala 


11* 


A*p Asp 


Lou Clu Mot 


Ly* Pba 


Cln Pbo Val 


Arg Ilo 






1265 




1270 




1275 


Hi* 


Ly. 


Pro 


Pro Pro 


Lou Thr Pro 


Lou Tyr 


net cly cy* 


Arg Tyr 






1280 




1285 




1290 


Thr 


v.l 


s*r 


Gly Sor 


Cly Sor Cly 


Hot Lou 


clu 11* Lou 


Pro Ly* 








1295 


1300 




1305 


Olu 


Ltu 


Olu 


Lou Cy* 


Tyr Arg S«r 


Pro Cly 


Glu Asp cln 


Lou Ph* 








1310 


131S 




1320 


Sar 


Olu 


Ser 


Tyr Val 


Cly His Lou 


Cly Sor 


Cly Ilo Arg 


Lou Cln 








1325 


1330 




1335 


Val 


Ly. 


A«p 


Ly* Ly* 


Asp Clu Tor 


LOU Val 


Trp Glu Ala 


Lou Val 






1340 


1345 


1350 


Ly. 


Pro 


Cly 


Asp Lm 


Hot Pro Ala 


Thr Thr 


L«u Ilo Pro 


Pro Ala 




1355 




1360 




136S 


Arg 


Il« 


Ala 


Val Pro 


Sor Pro Lou 


Asp Ala 


pro cln Lou 


Lou His 








1370 




137S 




1380 


Ph* 


Val 


Asp 


Cln Tyr 


Arc Clu Cln 


Lou 11* 


Ala Arg Val 


Thr Sor 








1385 




1390 




1395 


Val 


Olu 


Val 


val Leu 


Asp Ly* Lou 


Hi* cly 


Gin Val Lou 


sor Cln 








1400 


1405 




1410 


Olu 


Gin 


Tyr 


Glu Arg 


Val Lou Ala 


Glu A*n 


Thr Arg Pro 


Sor Cln 






1415 




1420 




1425 


Met 


Ara 


Ly« 


Lou Ph* 


sor Lou Sor 


Cln Sor 


Trp Asp Arg 


Ly* cya 






1430 




1435 


1440 


Ly. 


Asp 


Cly 


Lou Tyr 


Cln Ala Lou 


Ly* clu 


Thr His Pro 


His Lou 


1445 




1450 




1455 


He 


Met 


Clu 


Lou Trp 


clu Ly* cly 


sor Ly* 


Ly* Oly Lou 


Lou Pro 








1460 


1465 


1470 


L«u 


SOT 


sor 
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<210» 9 

<2I1> 328 

<212> PRT 

<313> Homo sapiens 

<220> 

<22i> i*isc_f oaturo 
<223> Incyto ID No: i 



Mat 


Ly* 


Lou Trp 


Hot 


Olu Sor His 


Lou Ilo 


Val Pro Glu 


Thr Arg 


1 


5 




30 




15 


Pro 


Sor 


Pro Arg 


Hot 


Hot Sor Asn 


Gin Thr 


Lou Val Thr 


olu Pbo 






20 




25 




30 


Ilo 


Leu 


Cln Cly 


Pbo 


ser Olu His 


Pro Olu 


Tyr Arg val 


Phe Leu 






35 




40 


45 


Pbo 


Sor 


Cya Pbo 


Lou 

SO 


Ph* Leu Tyr 


Sor Cly 

55 


Ala Lou Thr 


Oly Asn 
60 


Val 


Leu 


Ilo Thr 


Leu 


Ala Ilo Thr 


Pbo Asn 


Pro Cly Lou 


His Ala 








65 




70 


75 


Pro 


Mot 


Tyr Pbo 


PhO 


Lou Leu Aan 


Leu Ala 


Thr Met Asp 


He Xle 






80 




85 




90 


cy* 


Thr 


sor sor 


Ilo 


Mot Pro Ly* 


Ala Lou 


Ala sor Lou 


Val Sar 






95 


. 100 




105 


Olu 


Glu 


Sar Sor 


lie 


Sor Tyr Gly 


Gly Cy* 


Mat Ala Cln 


Lou Tyr 








110 


115 




12 D 


Ph* 


Lou 


Thr Trp 


Ala 


Ala Sor Ser 


Glu LOU 


Lou Lou Lou 


Thr Val 






125 




130 




135 


HOC 


Ala 


Tyr Asp 


Ar 0 
140 


Tyr Ala Ala 


Ilo Cy* 
145 


His Pro Leu 


His Tyr 
ISO 


Ser 




Mat HOC 


Ser 


Ly* val Pbo 


cys sor 


cly Leu Ala 


Thr Ala 
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155 

Val Trp Lou Lou Cy* 
170 

Mot Leu Arg Lou Asp 

185 

Pbo cys clu Val Pro 
200 

Tyr val Asa Gly vol 
21S 

Ilo val Asn Pbo Leu 
230 

Sor ser Ilo Lou Lys 
245 

Phe Ser Thr Cys Ser 

260 

Thr Ala Val Pbo Tyr 
275 

Ala cly Ly* Sor Ly* 
290 

Pro Thr Lou Asn Pro 
305 

Lys Ala Ala Leu Arg 

320 

<210> 10 
<211> 384 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> mi*c_f eature 
<223> Incyto ID Hot 
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Ala Val 
Pbo Cys 
Pro Leu 
Met He 
Mot Thr 
val Ly* 
Sor His 
Ala Tyr 
Leu Ala 
Leu He 
Lys Laru 



Asn Thr 
Gly Pro 
Lou Leu 
Val Leu 
Ilo Ala 
Thr Ala 
Leu Thr 
XI* sor 
Gly Leu 
Tyr Thr 
Pbo Pro 



160 165 
Ala Ilo Hi* Thr Gly Leu 
175 180 
Asn Val Ilo Ilo Hi* Phe 
190 195 
Lou Ser Cys Ser Ser Thr 
205 210 
Ala Asp Ala Ph* Tyr Gly 
220 225 
Ser Tyr Gly Pbo Ho Val 
235 240 
Trp Cly Arg Cln Lys Ala 
250 255 
Val Val Cys Mot Tyr Tyr 
265 270 
Pro Val Ser Cly Tyr Sor 
280 285 
Lou Tyr Thr Val Leu Ser 
295 300 
Leu Arg Asn Lys Glu Val 
310 315 
Phe Phe Arg Asn 
325 
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Hot 


Asn 


Val 


Leu 


Leu 


Ala 


Asp Ser 


Asn Sor 


Aan 


Ly. 


Ly. 


He 


val 


1 














10 






15 


Hi* 


Ly* 


His 


He 


To 


sor 


Leu Gin 


sor Ala 


Pro 


Ly. 


Thr 


Thr 


Asn 
















25 








30 


Leu 


Gin 


Pro 


Sor 


11* 


Ser 


Asp He 


Leu Leu 


Ser 


Val 


Glu 


Sar 


Asn 










35 




40 










45 


A»p 


Arg 


Lys 


Asn 


Val 


Sor 


Lys He 


Ly* Oly 


Asp 


cy* 


Phe 


Asn 


Thr 










50 




55 






60 


ATB 


Val 


Ser 


cy* 


Asp 

65 


Ser 


Lys He 


Thr Sex 
70 


Hot 


Glu 


Asn 


Asn 


Thr 

75 


GlU 


val 


Ser 


clu 


Phe 

80 


He 


Leu Leu 


Oly Lou 

as 


Thr 


Asn 


Ala 


Pro 


Olu 
90 


L*U 


Gin 


val 


Pro 


Leu 


Phe 


He Met 


Pbo Thr 


Lou 


Zlo 


Tyr 


Leu 


Ho 










95 




a{y Met 


100 








105 


Thr 


Leu 


Thr 


Gly 


Asn 


Lou 


He He 


Leu 


Ilo 


Lou 


Leu 


Asp 








110 




11S 










120 


S*r 


Hi* 


Leu 


Hi* 


Thr 


Pro 


Mot Tyr 


Pbe Phe 


Leu 


Ser 


Asn 


Lou 


ser 










125 




130 










135 


L*u 


Ala 


Cly 


He 


Gly 


Tyr 


Sor Ser 


Ala Val 


Thr 


Pro 


Ly. 


Val 


Lou 










140 




145 








ISO 


Thr 


Oly 


Leu 


Leu 


He 

155 


Olu 


Asp Lys 


Ala He 

160 


Ser 


Tyr 


Sor 


Ala 




Ala 


Ala 


Oln 


Met 


Phe 


Phe 


Cys Ala 


Val Pbe 


Ala 


Thr 


Val 


Glu 


Asn 










170 




175 










18D 


Tyr 


Leu 


Leu 


Sor 


Ser 
185 


Met 


Ala Tyr 


Asp Arg 
190 


Tyr 


Ala 


Ala 


Val 


Cys 

195 


Asn 


Pro 


Leu 


His 


Tyr 


Thr 


Thr Thr 


Met Thr 


Thr 


Arg 


Val 


cys 


Ala 










200 






205 






210 


cys 


Leu 


Ala 


11* 


Gly 
215 


cys 


Tyr val 


Ha Gly 
220 


Pbe 


Leu 


Aon 


Ala 


Ser 
225 


11* 


Gin 


He 


Cly 


Asp 


Thr 


Pbo Arg 


Leu Ser 


Phe 


Cys 


Mot 


ser 


Asn 








230 




235 








240 


V*l 


He 


Hi* 


Hi* 


Phe 


Pbe 


cy* Asp 


Lys Pro 


Ala 


Val 


lie 


Thr 


Leu 
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245 

Thr Cya Sar Glu Lys 
260 

ser Ser phe Asn val 

275 

Tyr Leu Pbe He Lou 

290 

Oly Tyr Gin Ly* Pro 
305 

II* Pbe Leu Phe Tyr 

320 

Ser Ser Ser Hi* Ser 

33S 

Tyr Thr M*t He Ho 

350 

Arg Asn Ly* Asp val 
365 

Ala Lys Tyr ser Leu 
380 

<210> 11 

<211> 419 

<212> PRT • 

<213> Hobo sapiens 

<220> 

<221> Bi*c_foature 
<223> Incyto ID NOs 



250 255 
Hi* He Ser Glu Leu He Leu Val Leu He- 

265 270 

Pbe Pbo Ala Leu Leu vol Thr Lou Ho Ser 

280 285 

He Thr Ilo Lou Ly* Arg Hi* Thr Gly Lys 

295 300 

L*u Sor Thr Cya Gly Ser His Lou Ho Ala 

310 315 

He Thr Val lie Ilo Met Tyr He Arg Pro 

32S 330 

Hot Asp Thr Asp Lys II* Ala ser Val Phe 

340 34S 

Pro Hot Leu Ser Pro Ho Val Tyr Thr Leu 

355 360 

Lys Asn Ala Pbo Hot Ly* Val Val Olu Lys 
375 
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370 



Mot 


Leu 


Ala 


Ala 


Ala Phe 


Ala 


Asp Ser 


Asn ser ser ser Met Asn 


1 








5 






10 15 


Vol 


Ser 


Ph* 


Ala 


Hi* Leu 

20 


His 


Pbe Ala 


Cly Gly Tyr Lou Pro Sor 
25 30 


Asp 


Ser 


Oln 


Asp 


Trp Arg 
35 


Thr 


He He 


Pro Ala Leu Leu Val Ala 
40 45 


Val 


Cys 


Leu 


Val 


Oly Phe 
50 


Val 


Cly Asn 


Leu cys val He Gly He 
55 60 


Lou 


Leu 


Hi* 


Asn 


Ala Trp 

65 - 


Lys 


Cly Lys 


Pro Ser Met He Hi* Ser 

70 7S 


Leu 


He 


Leu 


Asn 


Leu Sar 


Leu 


Ala Asp 


Leu Ser Leu Leu Leu Phe 










80 




85 90 


Sar 


Ala 


Pro 


11* 


Arg Ala 
95 


Thr 


Ala Tyr 


Ser Lys Ser Val Trp Asp 

100 105 


Leu 


Oly 


Trp 


Pho 


Val Cy* 
110 


Lys 


Ser Ser 


Asp Trp Pba Ha His Thr 

115 120 


Cys 


Hot 


Ala 


Ala 


£«. 


Leu 


Thr Ho 


Val Val val Ala Lys Val 
130 135 


Cys 


Pbo 


Met 


Tyr 


Ala S*r 


Asp 


Pro Ala 


Lys Gin Val Ser He Hi* 








140 






145 150 


Asn 


Tyr 


Thr 


no 


Trp Ser 
155 


Vol 


Leu Val 


Ala il* Trp Thr Val Ala 

160 165 


Ser 


Leu 


Leu 


Pro 


Leu Pro 
170 


Olu 


Trp Pbo 


Ph* Ser Thr Ho Arg His 

175 180 


His 


Olu 


Gly 


Val 


Clu Met 


cy* 


Leu Val 


Asp Val Pro Ala Vol Ala 


Glu 








185 




190 195 


olu 


Pho 




Sar Met 


Phe 


Oly Lys 


Leu Tyr Pro Leu Leu Ala 










200 




305 210 


Phe 


Gly 


Leu 


Pro 


Leu Pbe 

215 


Pho 


Ala sor 


Pbe Tyr Pbe Trp Arg Ala 

220 225 


Tyr 


Asp 


Cln 


cy* 


Lys Ly* 

230 


Arg 


Cly Thr 


Ly* Thr Gin Asn Leu Arg 
235 240 


Asn 


Cln 


Ilo 


Arg 


Ser Lys 
245 


Cln 


Val Thr 


Val Met Leu Lau Ser He 

2S0 255 


Ala 


Ho 


Ilo 


Ser 


Ala Leu 


Leu 


Trp Leu 


pro Clu Trp vol Ala Trp 










260 




265 270 


Leu 


Trp 


Val 


Trp 


Hi* Leu 


Lys 


Ala Ala 


Cly Pro Ala Pro Pro Cln 
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27 S 








280 






285 


Gly 


Pbe He Ala 


Leu 


s* r 


Cln Val 


Leu 


Met 


Ph* 


Ser He Ser 


ser 




290 








295 






300 


Ala 


Asn Pro Leu 


11* 


Phe 


Leu Vol 


Met 


Ser 


Clu 


Glu Phe Arg 


Clu 






305 








310 




315 


Oly 


Leu Lys Cly 


Val 
320 


Trp 


Lys Trp 


Met 


Ilo 

325 




Lys Lys Pro 


Pro 
330 


Thr 


Val Ser Glu 


Ser 

335 


Gin 


Olu Thr 


Pro 


Ala 
340 


Gly 


Asn Ser Olu 


Cly 
345 


Lou 


Pro Asp Lys 


Val 


Pro 


Ser Pro 


Clu 


Ser 


Pro 


Ala Ser He 


Pro 




350 








355 






360 


Clu 


Lys Clu Lys 


Pro 


Ser 


S*r Pro 


Ser 


Ser 


Cly 


Lys Cly Ly* 


Thr 




365 








370 


375 


Glu 


Lys Ala Clu 


He 

380 


Pro 


Ho Leu 


Pro 


Asp 

385 


Val 


Clu Cln Pho 


Trp 
390 


Hi* 


Glu Arg Asp 


Thr 


Val 


Pro Ser 


Val 


Gin 


Asp 


Asn Asp Pro 


He 




395 








400 






405 


Pro 


Trp Clu Hi* 


Clu 


Asp 


oin ciu 


Thr 


Gly 


Olu 


cly Vol Lys 






410 






415 







<210> 12 

<211> 314 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> mi*c_f*aCure 
<223> Incyt* ID No. 



Cly Leu Pbe Ser Hi* Ser 
20 

Val Ala Val Met Phe 

35 

Leu Pbe Leu He Arg 

SO 

Phe Leu Lou Ser Cln 
65 

Val Thr He Pro Lys 
80 

Ala Thr Ser Tyr Cly 
95 

Leu Hat Gly Val Ala 
110 

Asp Arg Tyr Val Ala 
125 

Met Arg Arg Oln Val 
140 

Val Gly Val Leu Asn 

155 

Phe Pro Tyr Cy* Ala 

170 

Val Pro Ala Leu Leu 
185 

Olu Hot Ala Leu Ser 

200 

Leu Ser Leu He Ala 

215 

Leu Ser Met Arg Ser 

230 

Cys Ser Ser His He 
24S 

Val Phe Hot Tyr Met 
260 

Asp Asn Val Val Ser 



Oln Ser 
Oly 
Val Ilo 
VOl Asp 
Leu Ser 
Met Ala 
cly Oly 
Glu oly 
val Cys 
Cys Leu 
Ala ser 
Ser Arg 
Lys Leu 
Thr Ser 
Thr Ser 
clu Glu 
Thr val 
val Pro 



Val Ala 
Sar Arg 
Cly Leu 
ser Arg 
Leu Phe 
Ser Asp 
Ala Ala 
val Leu 
Oln Pro 
Leu Met 
He Gin 
He Val 
Ser Cys 
Cly Val 
Tyr Cly 
Ala Arg 
Val Cly 
Cys Ala 
Tyr Ser 



Ser Asp Phe 

10 

Cln Leu Leu 
25 

Leu Gly Asn 
40 

Leu Hi* Thr 

55 

Asp He Gly 
70 

Pbe Leu Arg 
85 

Cln He Phe 
100 

Leu Val Leu 
115 

Leu Cln Tyr 
130 

Hot Gly Ser 

145 

Thr Ser He 
160 

Asp His Pbe 

Ala Asp Thr 
190 

Leu He Leu 

205 

Hi* Val Leu 
220 

Hi* Ly* Ala 

235 

Leu Pbe Tyr 
250 

Tyr Hi* Ser 
265 

Leu Val Thr 



He Lou vol 

IS 

Phe Ser Leu 

30 

Thr Val Leu 
45 

Pro Met Tyr 
60 

Cy* Pro Met 

75 

Oly Glu Gly 
90 

Phe Leu Thr 

105 

Hot Ser Tyr 
120 

Pro val Leu 

135 

Ser Trp Val 

ISO 

Thr Leu Hi* 

165 

Pbe Cy* Olu 
180 

Cy* Ala Tyr 
195 

Met Leu Pro 

210 

Oln Ala Val 

225 

val Thr Thr 
240 

Oly Ala Ala 

255 

Pro Cln Cln 
270 

Pro Thr Leu 
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27S 

Asn Pro Lou He Tyr 
290 

Leu Val Lya Vol Leu 

305 

<210> 13 

<211> 254 

<212> PRT 

<213> Haao sapiens 

<220> 

<221> raimc_featuro 
<223> Incyte ID tie: 



Val Val Oly Val Leu 
20 

Ser Phe Tyr Cy» Thr 

35 

Val Lye His Leu He 

50 

Cly Val Pro Gin Thr 
65 

Asn Ala Leu Gly Cys 
80 

Arg Oly Val Ser lie 
95 

Val He Thr Vol Ser 
110 

Glu Lya Ala Pro Lye 
125 

He Val eye Met Leu 
140 

Gly Lya Trp Aen Tyr 

Tyr Cya Ser Gly Gly 
170 

Ala Met Leu Leu Ser 

IBS 

Leu Trp Val Ser Ser 
200 

Gin Arg Val Gin His 
21S 

ser Pro Glu Asn Arg 
230 

Thr Phe Val Ser Ser 
245 

<210> 14 

<2H> 362 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> mlsc_feature 
<223> incyte ID »Oi 



Cys Asp Ala Glu Cly 
20 

He Lou Gly He Val 



Ser Leu Arg Asn 
Ser Arg Ale Cly 



i ir^jajaa 
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Val Ala 
Gly Ser 
Cly Cya 
Val Ala 
Met Ale 
Lya Leu 
Gly Thr 
per Arg 
His Val 
Val Asn 
Thr Aen 
Cly Aen 
Phe Pro 
ser Met 
He Asp 



Val Gly 
Phe Ser 
Arg Leu 
Asn Pbe 
Ala Pbe 
Vel Pbe 
Thr Cye 
Lye Ser 
Cly Phe 
He He 
He Thr 
Asn Lya 
Asp Val 
Val Cya 
Arg Ser 



380 

Pro Glu Val ' 
295 300 
Leu Arg Gin Met Cys 
310 



Leu 

100 
Arg 
115 
Ser 
130 
Phe 
145 
val 
160 
He 
175 
Leu 
190 
He 
205 
Asp 
220 
He 
235 







Gin 










is 


















Ser Thr 


Asp 


Leu 


He 








45 


Ala Leu 


Arg 


cys 


Lys 
60 


Val Arg 


Tyr 


Phe 


Leu 

75 


Leu His 


Arg 


Val 


°ll 


Leu Ser 


val 


Phe 


Gin 








105 


Trp Ala 


Lys 


Leu 


Lys 








120 


Val Leu 


Leu 


cys 


Trp 








135 


Pro Met 


Tyr 


Val 


Thr 






150 


Asn Glu 


Asp 


Leu 


Oly 






165 


Ala Gin 


Thr 


Leu 


Arg 








1B0 


Cys Leu 


Cly 


Lou 


Mot 








195 


Leu His 


Arg 


His 


Lys 








210 


Leu Ser 


Pro 


Arg 


Ala 






225 


Leu He 


Leu 


Val 


S«r 








240 


Leu Phe 


Gin 


Val 





7472077CD1 

He Glu Ser Thr 
Pro Trp Gly He 
Val Thr He Leu 



WO 01/57085 
Phe Leu Met Arg I 
Pro Thr Gin Leu t 
Leu Ala Phe Ala 
Val Arg Tyr Phe 
Cys Leu Leu Ala 
Cys Val Ser Phe 
Cys Ser Leu Leu 
He Met Thr Arg 
Leu Asn Val Asp 
Bet Ala Leu Thr 
Cya Glu Asn Trp 



Leu Phe Ser He 
Arg Gly Asn Pro 
Val Val Cys He 
Leu Tyr He Val 
Gin Glu Cys Pre 
Gin His Ser Phe 
Trp Lye Val Leu 
Ala Arg Asp Ser 
Tyr Gly Thr Pro 
Cys Phe He Pro 
Cly Val' 



Pha He 

80 
Leu Pbe 

95 
His Ala 

110 

ser Trp 

125 

Gin He 
140 

Cly Met 
155 

Pha Val 

170 

Pha Phe 
105 

Lya Gin 
200 
Ha He 

215 

Gin Phe 
230 

Ala Leu 
245 

Pro Glu 
260 

Leu Gin 
275 

Gin Val 
290 

Leu Asn 

305 

Asp Cly 
320 

He Cln 

335 

Gin Ala 
350 



ASP 

Leu Leu 
He Glu 
Gly Val 
Ser Asn 
Thr Thr 
He He 
Met Phe 
Val Leu 
val Ser 
His Cly 
Trp Val 
Gin Arg 
Val Thr 
Leu cys 
Cly Asn 
Glu Asn 
Ser Asp 
Ala Glu 
Pro Cln 
Lys Leu 



Ser Cln Trp 
SS 

Val Leu Gly 
70 

Asn cln Gin 
85 

Pha Ala Lou 

100 

Val Lys Leu 
IIS 

Leu Cys He 

130 

Thr Glu Tyr 
145 

Asn Met Thr 
160 

Val Tyr Val 
175 

Ala Thr Pbe 
190 

Leu He Phe 

205 

Trp He Ser 
220 

Pro Cln Trp 
235 

Ala Trp Val 
250 

Leu Tyr Arg 
265 

Cys Pro Val 
280 

Glu Lau Ser 
295 

Leu Ser His 

310 

Asp Val Ala 

325 

Val Asp Pro 
340 

Pro Gin Gin 
355 



Asn val Leu 
60 

Leu Phe Cly 
75 

Thr Ale Pro 
90 

cys Pbe Ser 

105 

Val Arg Gly 
120 

Ala He Gly 

13S 

Val Thr Leu 
150 

Pro cys Cln 
165 

Leu Phe Leu 

180 

Cys Gly Pro 
195 

He Thr Val 
210 

Met Leu Leu 

225 

Asp Asp Pro 
240 

Pbe Leu Leu 

255 

Sar Cy» Arg 

270 

Thr Ala Tyr 

Arg Asp Lys 
300 

Ser Gly Ala 
315 

Leu Thr Ser 
330 

Thr Gin Glu 
345 

Asp Ala Cly 
360 



<210> IS 

<211> 370 

<212> PRT 

<213>- Homo sapiens 

<220> 

<221> mioc_f aature 

<223> Incyte ID No: 7472082CD1 



Gly Asp Tyr Phe Leu Leu 
10 IS 
He Leu Clu Ser Leu Ala 



Met Cys Lys Cys 


Pbe Arg Ser 


Cly 


Asn 


Ser Thr 


Pro 


Val Leu 


Cys 


1 


5 






10 






15 


His Arg Asn Ser 


Glu Ala Trp 


Gin 


Pro 


Arg Lys 


Ale Pro Arg 


Thr 




20 






25 






30 


Gin Gin Thr Asp 


Met Gly Tyr 


Thr 


Asn 


Leu Asn 


Ser 


Lys Lys 


Glu 




35 






40 




45 


Cys Met Tyr lie 


Lys Clu Asn 


Phe Lys 


Lys Thr 


Val 


Asp Lys 


He 




50 






55 




60 


Val Asp Pro Gly 


Asn His Ser 


Sar 


Val Thr Glu 


Ser 


He Leu 


Ala 




65 






70 






7S 


Cly Leu Ser Glu 


Gin Pro Glu 


Leu 


Gin 


Leu Arg 


Leu 


Pbe Leu 


Lau 




80 






85 






90 



WO 01/57085 
Pbe Leu Cly He 
He Thr Leu He 
Tyr Pbe Leu Ser 
Val He Thr Pro 
He He Ser Tyr 
He Phe Ala He 
Asp Cya Tyr Val 
Met Ser Tyr His 
Leu Cly He Lou 
Leu Phe Leu Cys 
Leu Phe Pro Leu 
Glu Leu Leu Val 
Ala Leu Thr He 
Leu Arg lie His 
Cys Ser Ser His 
Ala Pbe Met Tyr 
Arg Lys Val Ser 
Asn Pro Leu He 
Vel Lye Lys He 

<210> 16 

<211> 319 

<212> PRT 

<213> Hoao sepiei 

<220> 

021> »isc_featU! 
<223> Incyte ID 

<400> 16 
Met Thr Pro Gly 

Vel Thr Lys Pbe 

Gin Glu Leu Leu 

Vel Val cly Asn 

Hie Leu oin Ser 

Leu Asp He Cys 

Asn Phe Leu Val 

Val Cln Leu Ala 

Leu Leu Ala Ser 



PCT/USOl/03455 



cys Val 

95 
Cly Leu 
110 

Ser Leu 

12S 

Lys Met 
140 

Pro Glu 
155 

Ala Glu 
170 

Ala He 

185 

His Cys 

200 

Cly Ser 
215 

Lys Thr 
230 

Leu Cly 
245 

Leu Val 
260 

Leu Ala 

275 

Ser Thr 
290 

Ha Leu 

305 

Leu Cln 
320 

Sar Val 

335 

Tyr sar 

350 

Lou His 

365 



Val Thr 
Ser Ser 
Sor Phe 
Leu Vol 
Cya Met 
Cys His 
cys ser 
Pbe Trp 
Thr He 
Asn Val 



Ser Tyr 
Clu Gly 
Ala Val 
Pro Ser 
Phe Tyr 
Leu Arg 
Cln Thr 



Pro 
His 
clu 
Leu 

Leu Ale Ale Met 



Asn Pbe Ala 



Ala Gin Leu 
Met 

Pro 



145 

Gin I 

160 

Leu J 

175 

Leu 1 

190 
l Thr 

205 
i Thr 

220 
t Asn 

235 
i Sar 

250 

Phe 

265 

Phe 

280 

Ser 

295 

Val 

310 

Val 

325 

Thr 

340 

Lys 

355 
Cys 

370 



Asn 1 
Thr 1 
Cys I 
Thr ( 
Tyr I 



i Vol 
t Val 



Gly Met 
10S 
Met Tyr 
120 
ser Thr 
135 
Lys Asn 
150 
Phe ser 
165 
Ala Tyr 
180 
) Val He 
195 
i Tyr lie 
210 

: Leu Arg 
. 225 
i cys Asp 
240 
He Asn 
255 
i Met Pro 
270 



ser He 
28S 

ser Thr 

300 
Ser Ala 

315 
Asp Gin 

330 
Met Leu 

345 
Leu Ala 

360 



No. 747212BCD1 

Glu Leu Ala Leu Ala Sar Cly Asn His Thr Pro 

S 10 IS 

He Leu Gin Oly Pbe Ser Asn Tyr Pro Asp Leu 

20 25 30 

Phe Oly Ala He Leu Leu He Tyr Ala He Thr 



35 



45 



Leu Oly Met Met Ala Leu He Phe Thr Asp Ser 

50 55 60 
Pro Met Tyr Pbe Phe Leu Asn Val Leu Ser Pbe 

65 70 75 

Tyr Ser Ser Val Vel Thr Pro Lys Leu Leu Val 

80 85 90- 

Ser Asp Lys Ser He Ser Phe Olu Oly Cys Val 

95 100 10S 

Pbe Phe Val Val His Val Thr Ala Glu Ser Phe 

110 US 120 

Met Ala Tyr Asp Arg Phe Leu Ala He Cys Gin 

125 130 135 



WO 01/57085 
Pro Leu His Tyr 
Leu val Ala Vel 
Cln Thr cly Asn 
Leu Thr His Tyr 
Cys Ala Asn Thr 
Ala Leu val Thr 
Cy s Leu val Leu 
Arg Clu Lys Asp 
Ala He Phe Tyr 
Gly ser Thr Asn 
Thr He He He 
Asn Lya Glu val 
He Phe Pro Gly 



Gly Ser He 
140 

Ser Tyr Ala 

155 

Val Phe Ala 

170 

Tyr cya Asp 
185 

Ala Thr Ala 

200 

Leu Leu Pro 
215 

Val Ala He 

230 

Leu Ser Thr 
245 

Cly Thr Val 
260 

Asn Thr Asn 

275 

Pro Met Leu 

290 

Lys Oly Ale 

305 



Met Thr 
Phe Cly 
Leu Pro 
He Pro 
Arg vel 
Ala Ala 
Cly Arg 
cys Ala 
Val Phe 
Gly Gin 
Asn Pro 
Leu Cln 



Arg Cly Thr Cys Leu Gin 
145 150 
Cly Ala Asn Ser Ala He 
160 165 
Phe Cys Gly Pro Asn Cln 
175 180 
Pro Leu Leu His Leu Ala 
190 195 
Val Leu Tyr Vol Phe ser 
205 .210 
Val He Leu Thr Ser Tyr 
220 225 
Met Arg Ser Val Ala Gly 
235 240 
Ser His Phe Leu Ala He 
250 255 
Thr Tyr Val Cln Pro His 
265 270 
val Val Ser Val Pbe Tyr 
280 285 
Pbe He Tyr Sar Leu Arg 
295 300 
Arg Lys Leu cln Vol Asn 
310 315 



<210> 17 

<2I1> 312 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> mis cofeature 
<223> Incyte ID No: 

<400> 17 

Met Asp Thr Cly Aen 
1 5 
Gly Phe Pro His Leu 
20 

Leu Leu Leu He Tyr 

35 

Pbe Leu Val Vel Cys 

50 

Hie Phe Val Ser He 
65 

Ala Thr He Pro Lys 
00 

Thr He Sor Phe Ser 

95 

Ser Leu Gly Ale Thr 
110 

Asp Arg Tyr Leu Ala 

12S 

Met Thr Pro Thr Leu 
140 

Gly Cly Leu Ala Cly 
155 

Leu Pro Phe Cys Gly 
170 

Pbe Pro Pro val Lou 
185 

Val Leu Val Asp Pbe 

200 

Pbe Leu Leu He Leu 

215 



7472134CD1 



Trp Ser 
Gin Cly 
Leu Met 
Leu Asp 
Leu Ser 
Met Leu 
Gly Cys 
Olu Cys 
He Cys 
Cys Ala 
Pro Val 
Pro Aen 
Ser Leu 
Val Ha 
Cys Sor 



Cln Val Ale 
10 

val Cln He 
25 

Thr Val Leu 
40 

ser Arg Leu 

55 

Phe Ser Olu 

70 

Ala Asn Leu 
85 

Leu Leu Cln 
100 

Tyr Leu Leu 
HS 

Arg Pro Leu 
130 

Glu He Ala 

. 145 
Val Clu He 
160 

Arg He Cln 
175 

Ale cys Thr 
190 

Asn Ser Cys 

205 

Tyr val Oln 
220 



Clu 
Tyr Leu 
Oly Asn 
His Thr 
Leu Gly 
Leu Ser 
He Tyr 
Thr Ala 
His Tyr 
He Oly 
Ser Leu 
His Val 
Asp Thr 
Lys He 
He He 



Pbe He He Leu 



30 

Leu Leu He 
45 

Pro Met Tyr 
60 

Tyr Thr Ala 
75 

Glu Lya Lye 
90 

Phe Pbe Hie 

10S 

Met Ala Tyr 

120 

Pro Thr Leu 
135 

Cys Trp Leu 
150 

He Ser Arg 
165 

Pbe Cys Asp 
180 

Ser He Asn 

195 

Leu Ale Thr 
210 

Cys Thr Val 
225 



16731 



17/31 



WO 01/57035 



I PCTAJS01AWJ55 



Leu Arg lie Pro Ser 


Ala Ala Cly Lys 


Arg 


Lys 


Ala 


11a' 






230 




235 










^40 


Cys Ala Ser His Pho 


Thr val val Leu 


lis 


Phe 


Tyr 


Gly 


Sar 


lla 


245 




250 










255 


Lm Sar Het Tyr Val 


cln L«u Lys Lys 


Ser 


Tyr 


Sar 


Lau 


Asp 


Tyr 


260 


265 










270 


Asp Cln Ala L«u Ala 


Val Val Tyr Sar 


Val 


Leu 


Thr 


Pro 


Pbe 


Lau 


2-75 


280 










265 


Ana Pro Phe 11* Tyr 


Sar Lau Arg Asa 


Ly* 


Clu 


Ila 


Lym 


Clu 


Ala 


290 




295 










300 


Val Arg Arg Gin L«u 


Lys Arg I la Gly 


Ila 


Lau 


Ala 








3 05 




310 













<210> 18 
<211> 321 

<2i2> par 

<213> Hotao sapians 

«220> 

<221> talsc.f eature 
<223> Incyta ID NO: 

«400> 18 

Mat Asn Oln Thr Lau 
1 S 
Asn Tyr sar Arg Gly 
20 

Sar Sar Lau Ala Mat 

35 

sar Mat Val Ila Trp 

50 

Pho Cys Ila Tyr Ila 
65 

Lau Pho sar Mat Ala 
80 

Val Asn Thr Thr Asp 
9S 

Tyr Pha Ala Tyr Thr 
110 

Thr Cln Arg cys Lau 
125 

Mis Arg Pro Arg His 
140 

Thr Lau Cys Leu Lau 

155 

Lys Pha Lau Lys Pba 

170 

Val oln Ala Ala Lau 
185 

Lau Sar Sar Leu Thr 

200 

Oln Trp Arg Arg Cln 

Sar Vol Lau Val Pha 

230 

Trp Pba Val Lau Tyr 
245 

Lau Cys Pha Sar Lau 

260 

Ala Asn Pro Val Ila 
275 

Arg Lau Pro Thr Arg 
290 

Arg Glu Glu Pro Clu 
305 

Thr Asn clu Met Cly 
320 



7472136CD1 



Asn 


sar 


Sar Gly 


Tbr 


val 


clu 


Ser 


Ala 


Lou 






10 










15 


Sar 


Thr 


val His 


Thr 


Ala 


Tyr 


Leu 


Val 


Leu 








25 








30 


Pba 


Thr 


cys Leu 


Cys 
40 


Cly 


Met 


Ala 


Cly 


Asn 
4S 


Lau 


Leu 


Gly Pbe 


Arg 


Met 


His 


Arg 


Asn 


Pro 






55 








60 


Leu 


Asn 


Leu Ala 


Ala 


Ala 


Asp 


Leu 


Leu 


Phe 








70 








75 


Sar 


Thr 


Leu sar 


Leu 
85 


Glu 


Thr 


Cln 


Pro 


Leu 
90 


Lys 


val 


His Glu 


Leu 
100 


Hst 


Ly» 


Arg 


Leu 


Met 

105 


val Gly 


Leu sar 


Leu 


LeU 


Thr 


Ala 


He 


ser 








115 










120 


Sar 


Val 


Lau Pha 


Pro 
130 


lie 


Trp 


Phe 


Lys 


Cys 
13S 


Leu 


Ser 


Ala Trp 


Val 


Cys 


Cly 


Leu 


Leu 


Trp 








145 








150 


Met 


Asn 


Gly Leu 


Thr 


Ser 


Ser 


Phe 


cys 


Ser 






160 








165 


Asn 


Glu 


Asp Arg 


Cys 
175 


Phe 


Arg 


Val 


Asp 


Met 

180 


lie 


Hat 


Gly Val 


Leu 
190 


Thr 


Pro 


Val 




Thr 
195 


Leu 




Val Trp 


Val 


Arc 


Arg 


Ser 


Ser 


Gin 






205 








210 


Pro 


Thr 


Arg Leu 


Phe 


Val 


Val 


val 


Leu 


Ala 






220 










225 


Leu 


He 


Cya Sar 


Leu 
235 


Pro 


Leu 


Ser 


He 


Tyr 
240 


Trp 


Leu 


Ser Lau 


Pro 


Pro 


Glu 


Met 


Cln 


Val 






250 










255 


Ser 


Arg 


Lau Ser 


Ser 


Ser 


Val 


Sor 


Ser 


Ser 






265 










270 


Tyr 


Pbe 


Leu Val 


Gly 
280 


Ser 


Arg 


Arg 


Ser 


His 
285 


Ser 


Leu 


Gly Thr 


val 


Leu 


Gin 


Oln 


Ala 


Leu 






295 










300 


Leu 


Clu 


Cly Gly 


Glu 


Thr 


Pro 


Thr 


Val 


Cly 






310 










315 


Ala 



















<210> 19 

<211> 316 

<212> PRT 

<213> Hoao sapiens 

<220> 

<221> »isc_feature 
<223> incyte 10 No: 



PCT/USO 1/03455 



7472142CD1 



Cly Phe ser Gin Tyr 
20 

Ser Leu Val Met Tyr 
35 

He Leu He Thr He 

50 

Leu Phe Leu Gly Asn 

65 

Ala Ser Val Pro Thr 
80 

Thr He He Phe Ser 

95 

Ala Met Gly Sar Thr 
110 

A«p Arg Tyr val Ala 

125 

Met Asn Arg Cys Val 
140 

Thr Gly Cys Leu Thr 

155 

He Pro Leu Cys Cly 
170 

Leu Ala Val Leu Lys 

185 

Thr Ila Met Leu Val 

200 

Leu Leu Val Cys He 

215 

Arg He Thr Ser Ala 

230 

Cly Ala His Leu Thr 
245 

Ser Met Tyr Leu Lys 

260 

Lys He He Ser Leu 

275 

Pro He He Tyr Ser 
290 

Lys Lys Leu Leu Cly 

305 



Pro Cly 
Leu Thr 
Leu Asp 
Leu Ser 
Leu Leu 
Cly Cys 
Glu cys 
He Cys 
cys Ala 
Ala Leu 
Asn Leu 
Leu Ala" 
Val ser 
Ser Tyr 
Clu Cly 
Val Val 
Pro ser 
Leu Tyr 
Leu Arg 
Lys He 



He Trp 

Thr Leu 
ser Arg 
Pbe Met 
Val Asn 
Ala Val 
val Leu 
Asn Pro 
Arg Het 
Leu oiu 
He Asp 
Cys Thr 
He Leu 
He Phe 
Arg Asn 
He Leu 
Ser Ser 
Gly Val 
Asn Lys 
Thr Leu 



ser He Pbe 

10 

Val Val Leu 
35 

Leu Cly Asn 
40 

Leu Lys Thr 

55 

Asp He Cys 

70 

Leu Leu Sar 

85 

Oln Met Tyr 
100 

Leu Ala val 

lis 

Leu Arg Tyr 
130 

Ala Thr Val 
14$ 

Thr Ser Phe 
160 

His Phe Thr 

175 

Ser Ser Leu 
190 

Leu Leu Pro 
20S 

He Leu Ser 

220 

Lys Ala Phe 

235 

Tyr Tyr Cly 
250 

Asn Ala Gin 

26S 

Leu Thr Pro 
280 

Glu Val Lys 
295 

His Gin Thr 
310 



Phe Leu Clu 
15 

Phe val Phe 
30 

Ser Thr Leu 
45 

Pro Met Tyr 
60 

Tyr Thr Ser 
75 

Ser Cln Lys 
90 

Leu Ser Leu 

105 

Met Ala Tyr 

120 

Ser He He 

135 

ser Trp Val 
150 

Ala Leu Oln 
165 

Cys Glu He 
180 

Leu Met Asn 

195 

He Pro Met 

210 

Thr He Leu 

225 

Ser Thr Cys 
240 

Ala Ala Leu 

255 

Lys He Asp 

370 

Het Leu Asn 
285 

Asp Ala Met 

300 

His Olu His 
315 



<210> 20 

<211> 325 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> miec_f eature 

<223> Incyte ID No: 7472171CD1 



Cly Phe 
cys Ser 
Met Ala 
Pbe Phe 
Thr Phe 
Thr He 
ser Thr 
Asp Arg 
Met Ser 
Leu Cys 
Leu Pro 
He Leu 
Glu Asp 



Ser Gly 
Ala Het 
Val Ser 
Leu Cly 
Val Pro 
Ser Phe 
Cly Ser 
Tyr Leu 
His Arg 
Leu Leu 
Phe Cys 
Ala val 
Phe Leu 
i Pbe He 
r Val Pro 



He Pha Mat Tyr 
Asp Olu Val Phe 
Asn Pro Thr He 
Ala Arg Lys Val 
Arg Oly Ser Val 

<210> 21 
<311» 313 
<212> PRT 
<213> Hoao sopiei 

<220> 

<221> aisc.teatu; 
«223> incyte id 

<4 00> 21 
Het Oly Asn Trp 

Pro Ala Leu Leu 

Val Thr Tyr Thr 

Leu He Trp He 

Leu Ser Asn Leu 

Thr Pro Lys Lau 

ser Phe Ala Oly 



Tyr Pro Ala 

Tyr Leu Val 

35 

Val Leu Asp 

50 

Asn Leu ser 

65 

Leu Met Leu 
80 

Ala Val Cys 
95 

Thr olu cys 
110 

Ala lis cys 

125 

Leu Cys Val 
140 

Lys sar val 

Gly His His 

170 

Leu Lys Leu 

185 

Leu Ala Oly 
200 

Cys Leu Ser 

215 

Ser Ala Ala 

230 

Leu Ala val 
245 

Leu Lys Pro 
260 

Thr val Leu 
275 

Tyr Sar Leu 
290 

Trp Gly Arg 
305 

Oln Thr Cln 
320 



Leu Glu 
Thr Leu 
He His 
Thr Leu 
Val His 
Ala He 
Leu Leu 
Oln Pro 
Leu Leu 
Thr clu 
Val Val 
Ala cys 
Ser He 
Tyr Leu 
Arg cys 
Val 

Lys ser 
Tyr Ala 
Arg Asn 
Ser Arg 
val Ser 



His Leu 

25 
Leu Gly 

40 
Leu His 

55 
Asp He 

70 
Leu Leu 

85 
cln Het 
100 

Leu Als 

115 

Leu Arg 
130 

Mat Cly 
145 

Met Val 
160 

Ser His 
175 

cly Asn 
190 

Leu Leu 
205 

Leu He 



235 

Leu Phe 

250 

Lys Glu 

Met Val 
280 

Lys Glu 
29S 

Ala Ser 
310 
Gly 
32S 



Phe Pro Leu 

30 

Thr Ala He 
45 

Pro val Tyr 
60 

Tyr Thr Pro 
75 

Ser Arg Lys 
90 

Leu Ser Leu 
10S 

Thr Ala Tyr 
120 

His val Leu 

135 

Ala Trp Val 

ISO 

He Ser Met Arg 

165 

Phe Thr Cys Lys 
180 

Thr ser Val Ser 

195 

Leu Pro val Pro 
210 

Ala Thr He 

225 

Phe Ser Thr 
240 

Cly Thr He 

255 

His He Ser 

270 

Thr Met Leu 
285 

Lys Clu Ala 
300 

Clu Gly Cly 
315 



No» 7472172CD1 

Sar Thr Val Thr Olu He Thr Leu He Ale Phe 
S 10 15 

Olu He Arg He Ser Leu Phe Val Val Leu Val 
20 25 30 

Lou Thr Ala Thr Gly Asn He Thr He He Ser 
35 40 4S 

Asp His Arg Leu Gin Thr Pro Met Tyr Phe Phe 
50 55 60 

Ser Phe Leu Asp He Leu Tyr Thr Thr Val He 
65 70 75 

Leu Ala Cys Leu Leu Gly Clu Olu Lys Thr lie 



Cly 


Thr 


Val 


Clu Phe 


He 


Leu 


Leu 


Ala 


Val 


Met Ser 


Phe Asp. Arg 






110 










115 




120 


Tyr 


Met 


Ala 


He Cys 


Asp 


Pro 


Leu 


His 


Tyr 


Thr Val 


He Met Asn 






125 










130 




135 


ser 


Arg 


Ala 


cy* Leu 


Leu 


Leu 


Val 


Leu 


Cly 


cys Trp 


val Cly Ala 






140 










145 




ISO 


Pbe 


Leu 


Ser 


Val. Leu 
1SS 


Phe 


Pro 


Thr 


He 


Val 
160 


Val Thr 


Arg Leu Pro 
165 


Tyr 


cys 


Arg 


Lys Clu 


He 


Asn 


His 


Phe 


Phe 


Cys Asp 


He Ala Pro 






170 










175 




180 


Leu 


Leu 


Gin 


Val Ala 


Cy» 


He 


Asn 


Thr 


His 


Leu He 


Glu Lys He 








185 








190 




195 


Asn 


Phe 


Leu 


Leu Ser 

300 


Ala 


Leu 


Val 


He 


Leu 

205 


Ser Ser 


Leu Ala Pha 
210 


Thr 


Thr 


Cly 


Ser Tyr 

215 


Val 


Tyr 


He 


He 


Ser 
220 


Thr He 


Leu Arg He 

325 


pro 


Sar 


Thr 


Cln Cly 


Arg 


Gin Lys 


Ala 


Phe 


Ser Thr 


Cys Ala Ser 








230 








235 




240 


His 


lie 


Thr 


Val Val 


Ser 


He 


Ala 


His Gly 


ser Asn 


He Pbe Val 








245 










250 




255 


Tyr 


Val 


Arg 


Pro Asn 


Gin 


Asn 


Ser 


ser 


Leu 


Asp Tyr 


Asp Lys val 




260 










265 




270 


Ala 


Ala 


Val 


Leu He 

275 


Thr 


Val 


Val 


Thr 


Pro 
280 


Leu Leu 


Asn Pro Pbe 
285 


He 


Tyr 


Ser 


Leu Arg 

290 


Asn 


Glu 


Lys 


val 


Gin 
295 


Clu Val 


Leu Arg Olu 
3 00 


Thr 


Val 


Asn 


Arg He 

305 


Met 


Thr 


Leu 


He 


Gin 
310 


Arg Lys 


Thr 



«210> 22 
<211> 1413 
<212> DMA 

<213> Homo sapiens 
<230> 

<221> mlsc_f eature 

<223> Incyte ID No: 7472033CB1 

<220> 

<221> unsure 

<222* 176, 383, 927 

<223> a, t, c, g, or other 



acggtgcacg 
aggccgctgg 
tggtaccaca 
acccttggtt 
t tea tec tea 
tcagaeetac 
tactggccct 
tgtattttcg 
gtgtacccca 
tgygcactgg 



gcgtgegcta 
ccctcaacac 
acccggtgct 
tgctcateag 
gttategece 
acctggccat 
ttttatgect 
ecggctcctg 
tgtcgacaat 
cgcgggacag 
cactgctgct 
cactcctgca 



ctggtgccca 
cgcatcaccg 
accaactgcc 
tttttcaacc 
tacgccatct 
tggcteaacg 
aatgtggctc 
cgcaagtatg 
agageggtae 



tectgategt 
tggtggctgt 
tactgateag 
tgtacgcgga 
gecacatgat 
icttecg 



ccagctggca 
tctggacgac 
cactgccatc 
tatcgatagg 
catgtaegge 
gtcatccgga 
atncggccaa 
agteaecatg 
etccatcctg 
atccataacg 
ectgeagtte 
ggee t eg c eg 
geagategge 
gegettctae 
ctcggtggca 
geaggenegg 
catcgccatc 



taaagaaatt 
cacgacaggc 
tetccaaagg 
gaggeggtet 



gatggggage 
cggcatccge 
aacacctcgg 
ttcctgagca 
gtgetaateg 
agcccatcat 
agcaagtgtg 
ecgetgacct 
tgeaaaaega 
gccattgcct 
gtgggcgcgg 
ctgttcgtet 
ctgeaggaca 
tactcgatct 
tactteggga 
aaggtggagc 
atctttggcg 
tcgccggtca 
tccgcctgct 
caagaactge 
tgeaaegtee 



atctgagtgg 
tggactacet 
atctgaatga 
acagggeggt 
tetteggege 
gcgcactgct 
agtctcatce 
tgatggagat 
ttgecatget 
tcgacagata 
tgacgatcct 
acaaggagct 
cgatcccgta 
tctcgctgtg 

ggggycggcg 

tttcttggct 
etcagagcat 
ccaacccgtt 
tctgeegttg 



atacgccagc 
tgacctgggc 
gactgngagc 
ggacagcccc 
cctaggcaac 
cgeaatctgg 
gageggaett 
cctgtccaag 
gcaggcactt 
t c egg t gate 
ggeggggate 



etgeattgag 
egtacaatae 
gctgaagagc 
gatgaagege 
gccgcttgac 
gctagtccgc 
gctctacggc 
ctcagaeact 
gcgeaggcgt 



120 
180 
240 
300 
360 
420 



720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 



2GY31 



21/31 



gaggggggac tcgccaacat agt tea teat tga 

<210> 23 

<2U> 1076 

<212> DUX 

<213> Hoeao sapi»na 

<220> 

<221> misc_t«atura 

<223> Incyta ID NO: 7472034CB1 



<400> 23 
tctaeccaaa 
taacacctcc 
ctgaattaee 



PCT/US01/Q3455 



tttacctcct 
cctto caeca 
cctcctccae 
acategtctg 
tact tgtggc 
caatcctgca 
tactccttat 
taggecatge 
tcaatcgagc 
ttggtgtttc 
gacttaaggc 
ctggaatttt 
ttcttctggc 
tgaagactca 



tccaggttta 
tgctttagtg 
atctatgtac 
tgcacccaaa 
cctgatccag 
catggctctg 
tccagggytc 
ccccttcccc 
ctattgtgaa 
ttatgggetg 
ctatgcccac 
gtttagcaca 
ctccttectc 
cacaeggtat 
gcagatccgc 



ggcaatgtta 
ctcttcctgt 
gcccttgcag 
atgttcttca 
gategctatg 
atagggegea 
attttgttgg 
catatggctg 
actatggect 
atccCccagg 
tgtggctctc 
actcaccgct 
ctcctcatge 
cagcgagtgc 



attctggtaa 
eagcaatgga 
caattattaa 
agcactggat 
ccattetctt 
ccatgctagc 
tgctcctggt 
tecatgeatt 
tagecatttg 
tcggaatggt 
gaacacttat 
tCgtgaaact 
tgcttgtgat 
cagtgctgaa 
atatttgtgt 
ttggtcatca 
cacctgcgct 
tcagagtgtt 



tattacaaaa 
tccaaaccaa 
gctgatcaaa 
tgcactgccc 
cateatctgg 
tgccatcgac 
tcatgcccac 
ctcctccatg 
tcaccccttg 
ggtgctggtg 
cttctgccaa 
tgcctgctca 
tgggctggat 
ggtaccaggg 
catcctggtc 
tgtaccccat 
caatcccctt 



caaggttctt 
gatgaaatct 
gaggcaccag 
ctgggcatcc 
atggacccat 
ctggttctgg 
gagattgggt 
gagtcagggg 
caccatteca 
aggggattac 
gccaccatca 
gaaaccacag 
gttctggcca 
agtgaggccc 
ttctatgtcc 
catgtccatg 
gtctatggag 
gattga 



60 

120 

180 

240 

300 

360 

420 

480 

S40 

600 

660 

720 

780 

840 

900 

960 

1020 

1076 



<210> 24 

<211> 948 

<212> DBA 

<213> Hobo » apian* 

<220> 

<221> rai*c_f«aturo 

<223> Incyta ID No i 747203SCB1 

<400> 24 
atggaaaccc 
tggctggcta 
gtgactgcaa 
ctggctgctg 
tgctcaggag 
ttagccacag 
atetgeaage 
atggccatca 
ctacctttet 
aggttagcat 
atggtgggct 
tttggtctct 
ggggttatgg 
caggatgtag 
gccaccttaa 
agttatctga 



ctgcctcctt 
tctcactgag 
tctggatgga 
tggacattgt 
acagctcaat 
ctgtggagac 
ctctacacta 
ccatcagagc 
gtggctccaa 
gtgctgaccc 
ctgatgtggc 
cctcaaagac 
ctttgtacta 
tgcccttgca 
otcccatcat 
tgcatgtcct 



cetccttgtg 
tgccatgtac 
ttccactcgg 
tatggcctcc 
cagctttagt 
ggggetgctg 
eaagagaatt 
tatcacagcc 
tgtggttgtc 
cgtgcccagc 
ettcattget 
tgetcagtcg 
tctacctggg 
cacecaagtc 
etatggcatg 
ctttgaccat 



ggtatcccag 
ateatagece 
eatgagceca 
tcggtggtac 
gcttgtttca 
ctgaccatgg 
ctcacgcctc 
ataaetccac 
cactcctact 
agtctctaca 
gcctcctata 
aaagcactaa 
atggeatcca 
ctgctagctg 
aggaceaaac 
tccaacctgg 



gaetgeaate 
tgttaggaaa 
tgtattgett 
ccaagatggt 
ctcagatgtt 
cttttgaccg 
aagtga tgc t 
tgagttggat 
gtgagcacat 
gtctgattgg 
tcttaattct 
gcacatgtgg 
tetatgegge 
acctgtacgt 



gttcatga 



ttcacatctt 60 
caccatcatc 120 
tctgtgtgtt ISO 
gagcatcCtc 240 
ttttgtccac 300 
etatgtagee 360 
gggaatgagt 420 
ggtgagtcat 480 
agetttggee S40 
ttcctctctt 600 
caaggcagta 660 
ctcccatgtg 720 
ctggttgggg 780 
gatcatceca 840 
gagaatatgg 900 



<210> 25 
<211> 945 
<212> CHA 
<213> Hosao sapiens 

<220> 

<221> rai*c_£eatura 
<223> Incyta ID Not 



<400> 25 

atgtcagcct 

ggcctggaac 

ctgcttggaa 

atgtacctct 

eccaaaatgc 

geccagatgt 

gcctttgacc 

tccctcatca 

ctcecctccc 

tgtgaacaca 

ggcatcgctg 

atctttatte 

gggacatgcg 

teagtcatge 

ttctatctgc 

ateegtgaga 



ccaataCcac 
ace tgc neat 
eetgeactet 
ttctggccat 
ttgeeatatt 
tcttccttca 
gcta tgtggc 
ecaagattgg 
tgctgagatg 
tggctgtggt 
tggccatgtt 
ttcaggcagt 
tctctcatat 
accgtgtagc 
tcttcccacc 
gcatcttggg 



cttaacacat 
ctggatctcc 
ccttctcatc 
gttggcagce 
ctggttcagg 
ctccttctcc 
tatctgeaag 
catggctget 
tttceactac 
gaggctggcg 
tattgtggtg 
tetactgett 
aggtgecate 
ccgccatgct 
catggtcaat 
agtattecea 



ccaactgcct 
atccctttet 
atccaggctg 
atcgacctgg 
gategggaga 
atcatggagt 
ccactgcact 
gtggcecggg 
tgccgaggcc 
tgtggggaca 
ttggacctgc 
gcctctcagg 
ttagecttet 
gcccctcatg 
cccataatct 




ggggattcca 60 
tacactggcc 120 
ccatgaaccc 100 
ctcagcactg 240 
tgcctgtctg 300 
gctggccatg 360 
cctgactggg 420 
aatgactcca 480 
tcactgctoc 540 
caatatctat 600 
cctgtcttat 660 
caaggcattt 720 
ggtcatctct 780 
ecttgeeaat 840 
900 
945 



<210> 26 
<211> 966 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_f eatura 

<223> Incyta ID No: 7472037CB1 



<400> 26 
atggcccatc 
ctgc tggtgg 
ctcaccCacc 
cgcactetcc 
tacacactgg 
tccatcgcgg 
ttcctcctta 
tctgtgatca 
ttctcagtgg 
catgtggagc 
tcactcaagg 
ttcattctcc 
ggccggcgca 
ggttgtgcct 
ctggtggcgg 



aggegectga 
gattctccaa 
tggtcacact 
acacacctat 
tcaccatccc 
getgteggge 
ccetgatggg 
tgagacccac 
ccttgattga 
acttcttetg 
gacttggcat 
tctcctacgc 
aagccttctc 
ccatcatcta 
tgttctacac 
aggtgagggt 
aagtaa 

<210> 27 

<211> 996 

<212> DMA 

<213> Homo sapians 

«230> 

<221> nisc_f«atttra 
<223> Incyta ID No: 

<400> 27 
atgagtcctg 
aatctcaaca 
ctgacaggca 
atgtacttct 
ecaaagatge 
acccaattct 
tetgeggate 
gctgtgtget 



gaagcagcag 
cctcccagaa 
cagtggcaat 
gtaccgcttc 
caatatgetg 
ccagatgttc 
ctacyacagg 
cgtctgcctg 
gaectgeatg 
tgacattgcg 
cttcttcctg 
cttcattgtg 
tocctgcgca 
ectgaggecg 
cgtggtgaca 
agegctgagg 



gacaatggga 
ctgaggccca 
gccaccatca 
Ctggccgtgc 
gctcatctgc 
ttcttcctag 
tatgtggcca 
tgtttgggag 
atcttctcct 
cctgtgctga 
agcatcctcg 
getgecattg 
gcccacctca 
gactctgggg 
ccgctgctga 



cctggctggt 
ctctcttcat 
t caeca teat 
tctccctctc 
tgatggagag 
gcttgggttg 
tctgccatcc 
ccttggtttC 
cacccttctg 
agctcagctg 
tggtgctggt 
tgaggatccc 
egg tgg teat 
ctaatccctc 
accctgtggt 
caeggggctg 



7472039CB1 



atgggaacca 
gcgcaagagt 
atgtgttgat 
ttctgggtaa 
tgagcaattt 
atttctactt 
gctacctggc 
ttcgtgtggc 
ctgtggcctt 



i ccctggcctc 



cagtagtgat 
ggaattattt 
tgtgggggtg 
cctgtcctgc 
cctctcaagg 
etttcteggg 
catctgtcat 
cttggcctgc 
gcttcctttc 
gctccgcctg 
cctcgtcatt 



ccaacagagt 
tctgtgttte 
gtaagggctg 
ctagagatac 
caacacacta 
gcctccgagt 
cctctgcgct 
tgggtggggg 
tgtaagcagg 
gettgeacca 
gtatcttcct 



tcgtcctggc 
ttcttgtcta 
atactcgact 
tgctcacttc 
tttcctttgc 
tcttactgtt 
accccttgct 
gactcgtccc 
gtgetgtggt 



gacagagttc 60 
cttgttcctc 120 
ccaggtggat ISO 
tgagacctgc 240 
ccaggccatc 300 
cagecattgt 360 
cttgcgctac 420 
ctgctctggt 480 
tgyegcagge 540 
tgatgagagc 600 
ctccttcctc 660 
tteggectet 720 
cgcacatttt 700 
ccaggaccgc 84 0 
ttacaccctg 900 
tggogcattt 960 



agggctccca 60 
tctcctgaat 120 
acagacccct 1B0 
tgtcatcatt 240 
tgcatgtatc 300 
ggctgtcatg 360 
catgagtggg 420 
tgtgcttggt 480 
acagcacttc, 540 



tgctgatcac tgetgegtec 660 



WO 01/57083 



PCT/USO 1/03455 



tacggcctca ttgtgctggc agtcctgagc atcccctctg cttcaggccg teagaaggee 720 
ttctctacct gtacctccca cttgatagtg gtgaccctct tctatggaag tgccattttt 780 
ctctatgtgc ggccatcgca gagtggttct gtggacacta actgggcagt gacagtaata 840 
acgacatttg tgacaccact gttgaatcca ttcatctatg ecttaegtaa tgagcaagtc 900 
aaggaagctt tgaaggacat gtttaggaag gtagtggcag gcgttttagg gaatctttta 960 
ctbgataaat gtctcagtga gaaagcagta aagtaa 996 

<210> 28 

<211> 1014 

<212> ONA 

<213> Homo sapians 

<220> 

<221> nise_Csature 
<223> Incyta ID No: 

<400> 28 
atggggaacg 
gtggactgcc 
ctgtatgccg 



7472040CB1 



ttgctgtgct 
ccgtatggrtg 
agcgtcctgc 
tggtggtcta 
ctggccttgc 
ccagcccggc 
actgccatcc 
agtgccctcc 
gggttttttg 
ccgaactccg 
ctcgctcaca 
eggt cactgc 



attctgtcag 
tggatggcgc 
ccatcttcct 
cccgccggag 
gtttgtctct 
cagtgggctg 
tec tgg cage 
cggttcagcg 
tgcteaeegt 
tgcagtgtgt 
ggtttetttt 
tgtgctgggc 
tetgetggge 
cactcctggc 



ctacgagtat 
etgcctggcc 
ggtgggggtg 
ggtgggtgce 
gcccatcctg 
tegggegctg 
tctcagtgcc 
ggcgtgcggg 
gccctccgec 
ggtggactac 
tggcttcctg 



cagggccctg 
tcccatgctc 
tgggee 



gtggacagca agaaatccac cagccatgac 

<210> 29 

<211> 5122 

<212> DNA 

<213> Homo sapians 

<220> 

<22l> misc_faatur« 

<223> Xncyta ID No: 4250893CB1 

<400> 29 

cceggggete cactctgggi 
acctcttctc 
aagaatctga 
ctcccctctg 
gctgacttct 
accactcccc 
egaatgetga 
ctgataccct 
ggctggcgga 
gaaggagttc 
acccgctcag 



ggggattaca 
atcgacccgc 
ccgggcaatg 
acctggttgc 
gcagtgccca 
ccctccatca 
gacctctgct 
gtgcaggtgg 
atctaccgcc 
ggcggctcct 
gggcccctgg 
tgccggccgc 
ctgctggggc 
egggctgaac 
ttcctgtatt 
ctgagggagt 
ctggtctcgg 



gcgacctctc 
tgcgcgtggc 
ccatggtggc 
tccacctggc 
ttgcccgtgg 
tcctgctgac 
tcctggctct 
cctgtggggc 
ggetgeacca 
ccagcaccga 
tggccgtggc 
tgggcacagc 
tggtgcteae 
ccctcatcgt 



ggaccgccct 
cccgctccca 
ctgggtggct 
cgtggcggat 
aggecactgg 
catgtatgee 
cgggcctgcc 
agee tggaca 
ggagcacttc 
gaatgcggtg 
cagctgccac 
cattgtggtg 
tgtggeggce 
gggccttgcc 



sgatggaggt gtag 



ctcccacccc 
caggagaggg 
gtcaccacat 
ctccctatca 
ggcttttcct 
aactgggttc 
ctctgcctgc 
ggacagagat 
aggaggagct 
egggtgagae 
tggctcagta 
ggctgaggtc 
acageccaag 
tgcectggat 
gacagctgcc 
aagctcttcc 



actgtgcgcc 
tgaaccccac 
ccatgaattg 
tgacacatct 




taacaccctc 
tecagtgtct 
ctctatcggc 
ataagagacg 
caccacctgg 
gcctggggcc 
eagcttctge 
ccagagaaga 
cgggcctggg 
caagcccagg 
etggggtctc 
ccggcggggt 
ggacgccgct 
gaecatgagt 
aflctggggat 
caatggcctc 
gagaaatcag 
cacaccagcc 
tccacatggc 



tetgaaagee 
ctcaagggtt 
acagtgagcg 
cctgcctggc 
tgcaggcccc 
gtctatctgt 
gtaagageca 
gaacatcccc 
gcctggcetg 
tegecaataa 
cgagtggcat 
acctagccct 
aaggggcagg 
ccagccaacc 
gcacccaggg 



cattccctgc 
gatctcagga 
ttgcccacac 
cctggacccc 
tetgggctcc 
aggtgecctg 
aggcaaagga 
cagacaccct 
ttacttggag 
agegeactce 
ggaggtggcc 
ccatacctgg 
ccactctccc 
cacctccacc 
ctcagagaga 
ctctgcctca 
ggagtcaccc 
gcccagccta 
gtcaggaatt 
gcccccatgg 
tacagcccca ccaccaccca 
ectggaaaaa tgaggatttt 



tctgcggctc 
gtccaggacc 

atcccaggac 
tccctcccct 
ggatttataa 
cagcactgtt 
cttaactccg 
ttcctgaaga 
aggagetctt 
tegtacctgg 
gagcagatgg 
tcattcccct 
gcagtgctaa 
agggttttga 
cacgtctacc 




120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1014 



60 

120 

160 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

102O 

10BO 

1140 

1200 

1260 

1320 



PCT/US01/03455 



tcacacagct 
gaagctggcc 
ttggcecagg 
ttgggaagtc 
gggaccgett 
tgagtctege 
tcctgtctag 
tettgeagga 
cactgctggg 
ctcggaccac 
tcctggggtt 
ggcaagcaat 
ttgtgccctg 
aaaaactcac 
ctctccaagc 
gcatctggca 
gggecatcat 
gctacagctt 
aggatgagaa 
aagcatatgg 
taagtgatga 
ggaacctgat 
ccctccactg 
tcgaagaaat 
ttaaatccag 
catggagccc 
agattctett 
actegctgag 
tcctggagac 
cctttgggct 
eggatgetgg 
gaetgeaget 
ttagtgccag 
gcgtgcgact 
tggaccagac 

tggatacggg 
agagggegge 
tcccaattgc 
gcgtgccttc 
acttctgggg 
aecgagttca 
tgatgagaga 
agatcaaccc 
ctggagctgt 
acacatccct 



gctacttcta 
tgattatgtg 
cctggatacc 
aacactggcc 
ccagcatgtc 
tgagctcatc 
gecagagegg 
gocgagttct 
cagtttgctg 
agetctgeag 
Ctctgagtcc 
tagagecttt 
ggtgtcctgg 
actgacttcc 
teagecattg 



ctccaccttc 
cattcacctc 
ggpgagaggt 
aatacatggc 



tcctgaaaat 
accaccgcgt 
ecatteggaa 
cacccccgct 
ggatgctgga 
agctgttctc 



caactactct 
tgetgeaett 
ttgtcttgga 
ctgagaacac 
ggaagtgcaa 
aactctggga 
ccagcccttg 
ctctgcaaac 
gatgecttge 
ctacagcatc 



gcagtgggtc 
cttgtatgag 
gggcatgtgt 
ccgccacgtg 
caeca tgg ta 
ctccgtcctc 
ccactctgca 
cctgcggttg 
gagagecaac 
agccaaacac 
ggtcagctgt 
ccccagcctg 
gctctgtgag 
aactctgagt 
gctcatcttc 
agagatgagt 
ttcccatgtt 
tgagattgea 
tcctgcctct 
ccccacgggg 
cttccetgta 
ageggtgace 
acagcacagc 
ggaagctgtg 
gttccaaatg 
gctgcatcac 
gatccataat 
ceatcctgag 
ggecatagat 
gaccccactt 
aatactcccc 
ggagtcctac 
tgagactctg 
gatccctcca 
tgtggaccag 
caaactgeat 
gaggeceage 
agatggactc 
gaagggcagc 
acccttgagt 
aagttgccat 
tgggcattat 
tcagggatgt 



:ctc 
gaggagaatc 
caagaacctc 
aggcaggtga 
ttc tact tea 
ggaaaagatg 
ctgctcttca 
gagctctgtc 
gggaaaacta 
aacctcattc 
agcaggaagg 
aggttggtca 
ctggcctgca 
aagaccacca 
ggaccccagc 
cttttcagtc 
ttgaagatgg 
tgtttccaag 
aaacattcta 
ctgtttgggg 
gagatggaga 
ccgtccctgc 
acteggaaca 
gtagaaacag 
aagaagct tc 
gtcctgttca 
aaggtcacca 
gtgaagagtc 
gctggctgtg 
cagaccctga 
ctttge« 



ccaagatccc 
aattgagatc 
actgeagggg 
ggggagaggc 
gctggcccag 
tccggctccc 
tgtagatgag 
ecagccacag 
ggcatccttc 
geaggcaegt 
cagatatttc 
agagctctgg 
gcagcagatg 
tctacattac 
ctgctctctg 
caggaagcat 
agagcacccc 
agcaatgtcc 
agatttggaa 
aegtttccta 
ctgccggctg 
gcagccacae 
gacaeaagtg 
cttagtgtgc 
gggcaggcag 
agtcacagat 
ggagctggac 



ggcctcacgt 
aaggagctag 
gggctcaggc 
gatgagatga 
ageagaegga 
aatagcacat 
gctcaggcta 
gaggaaagct 
caaggggacc 
cctgtggcta 
gctggctcct 
gttgagattg 
tggatggtgg 
cacctccctc 
gcccacttta 
atagttctgg 
gccctgcget 
gaagtcacct 
gatctagaaa 
tatatgggct 
aaggaactgg 
gttggccact 
gtgtgggagg 
gcccgcatag 
tatcgagagc 
ggacaggtgc 
cagatgegga 
taccaagccc 



accccagaag 
gaggacattt 
gcatagtcat 
aggaagectg 
getgeagaga 
ggacagccac 
tcctcgatgg 
tgcactggag 
tactteccga 
cttctctgga 
aa tatttcta 
aatcaaaeaa 
ettgectgat 
caaccctctg 
tcagagacct 
cagatgacct 
gtattcttca 
agttctttgc 
attgeatcat 
catcaaccac 
acatctttca 
agetgetget 
aaacgttcct 
acatggagct 
age t gattga 
gg tgggtccc 
gaaacctgaa 
tttgtaagac * 
gcctcacagc I 
ccgagctgga « 
gactgagaca < 
Ctgactgctg 
acctgcagca 
atcctgcctg 
ggcaggaact 
aaccaagtgt 
cctcactcaa 
atctcaaact 
ccccagaggt 
tgcatacgaa 
ctgaggtagt 
accgctggcc caacaegggt 
aattctgtgt gtgggaccag 
cagggcctct gctggacatc 
actttgtggc tctccaaggg 
gatgetcctg 
cttctccccc 
caectctgtg 
cctgatccca 
gtttgtgega 
tgtgtctggg 
tcgaagccct 
gatcaggctg 
aecaggagat 
acctctggat 
ccgagtgaca 
gcagtacgag 
ettgagecag 
ccatcctcac 



ccaggacctg 
gaacaacctg 
caaactcata 
gagggecctg 
gatgacccct 
geggcagaga 
cctggacgtg 
agtaccggtg 
gcctttgggg 



ctggtcaaga 
agagacttat 
gctgctggaa 
cagctgtatg 
tccaaggtgg 
attagacaga 
ccaggatggg 
ccggcggatg 
ctgatcaegg 
tgggtagagg 
acagatgaaa 
gccctgtgtc 
aageggaagg 
cttgcccagg 
getgetgagg 
gggttagatg 
atccctctga 
tatgtcttgg 
aagaegctag 
ttgggcctgt 
tctcagggga 
tctctggagt 
atggcccatt 
actttctgea 
.cacagateaa 
gectattgge 
Ctaagtggaa 
cctcgetgcc 
aaggaccttg 
aatgtgctca 
aagctacagc 
gcctctgtgc 
gatgacgttg 
cgcctggggc 
gagcaggaga 
actgagggee 
cteggatcag 
agcaagatct 
gaactcttgt 
ac tgacgatg 
aagaacttgt 
etctgetttg 
ttcetgggtg 
aaggctgagc 
ggceatgtgg 



teattcccgt 
tccacctcta 
tgaaattcca 
gtcgttacac 
agetctgeta 
egggatcagg 
ccttggtgaa 
ccgtaccttc 
agctgatagc 
tgagccagga 
agctgttcag 



cctggctttg 
ctggtttgcc 
gtgtccagcc 



tcctgccact 
gctgaccctt 
ttccagcact 
agggatgeae 



ctttgggtct 



ttgggagtcc 
gtgttgcttt 
agtgactget 
atccacaagc 
tctggttcag 
ggagaagaec 
caagtgaaag 
ctcatgcctg 
gccccgeagt 
tcggtggagg 
agggtgctgg 
tcctgggacc 
ctcattatgg 
agtatcaaca 
cagtttcttt 
aacttatgat 
gtcaggtggg 



1380 

1440 

1500 

1560 

1620 

1680 

1740 ' 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2560 

2640 

2700 

2760 

2820 

2S80 

2940 

3000 

3060 

3120 

3180 

3240 

3300 

3360 

3420 

3480 

3540 

3600 

3660- 

3720 

3780 

3840 

3900 

3960 

4020 

4080 

4140 

4200 

4260 

4320 

4380 

4440 

4500 

4S60 

4620 

4680 

4740 

4800 

4B60 

4920 

4960 

5040 

5100 

5122 



24/31 



23/31 



gcaaccnacn 
atactagctg 
taaatgtgga 



<220> 

<221> mi«c_f catura 
<223> Incyta ID No: 



<400> 30 

atttatggag 

ttgagcctag 

aagggtttga 

gtgaggtgtg 

agctctgaan 

ccagcccaag 

cggagcacee 

cccteacagg 

ccatgtactt 

tgcccaaggc 

tggcccagct 

tggcctatga 

aggtgttctg 

tccacacggg 

tctgcgaggt 

tgattgtcct 

atggcttcat 

tctccacctg 

cctacataag 

acactgtgct 

aagcagccct 



6726656CB1 



tcacatgaag 
gatgatgagt 
agaataccgg 

taatgtcctc 
tttcttactc 
gctggccagt 
etatttcetc 
ccggtacgca 
cagcgggctg 
gctgatgctg 
ccctcccctg 
ggcggatgct 
cgtctccagc 
ctcttcccac 
cccggtctec 



aattacacag 
tgtcacCaat 
aaataaccgc 
gggagcagcc 
ctgtggatgg 
aeccagacgt 
gtgttcttat 
accaccttgg 
aacttggcta 
ctggtgtcgg 
acgtgggctg 
gccetctgee 
gccacagccg 
cgcttggatt 
ctgcttetct 
ttctacggca 
atcctgaaoa 
ctcaccgtgg 



caatgaattc 
actctgtgtg 
tgtccgaata 
ttcetcagga 
agagtcacct 
tggtaaccga 
teagctgttt 
ceatcacgtt 
ctatggacat 
aagagagctc 
catccceaga 
accegctgca 
tgtggctgct 
tctgtggccc 
cctgcagctc 
tagtgaactt 
tgaagac tgc 
tgtgcatgta 



tggaccgtag 
accaggaagc 
ccctgttggc 
cacctcttgt 
gatagtccca 
gttcatectg 
ectetteetc 
eaacectggg 
tatctgcacc 
catctcctac 
gctgctgctc 
ttacagcagc 
ctgcgccgtc 
caatgtcatt 
cacetacgtc 
cctgatgace 



tcccacagcc 60 
ttgagtgccc 120 
ctgctgtgat ISO 
ttatctctct 240 
gaaacccgtc 300 
eagggctttt 360 
taetctgggg 420 
Ctccacgctc 480 
tettccatca S40 



ttccctttct I 



ctcacggtca 660 
720 
780 
840 
900 
960 ' 
1020 
1080 
1140 
1200 
1241 



aacacggcca 
atccatttct 
aaeggtgtca 
atcgcgtcct 
cagaaagcct 
gtcttctacg 
ggcctgctgt 



<210> 31 
<211> 1155 
<212> DNA 

<213> Homo aopirao 



<221> mi oc„f *atur* 

<223> Incyta ID No: 7472062CB1 



<400> 31 

atgaatgtcc 

agcctacagt 

agtgttgaga 

agagtacctt 

atcctgcttg 

ctcatctacc 

tetcatctcc 

tactcctcag 

tcctacagtg 

tacctcttgt 



gttaccttga 
ggataccaga 
ata&ctgtca 
attgcatctg 
aggaacaaag 
gattcagtct 



ttctggcaga 
eagcccccaa 
gtaatgacag 
gtgattctaa 
gtctaaccaa 
tcaceactct 
acactcccat 
ctgtcactcc 
cctgtgctgc 
ccteaatgge 
tgacaacacg 
cttctatcca 
actttttctg 
agttgattet 
tttcctatct 
agcetttate 
tcatcatgta 
tgttctacac 



ttcaaattca 
gactaegaac 
gaagaatgtg 
aataacatcc 
tgccccagaa 
gactgggaac 
gtactttttt 
aaaggtttta 
tcagatgttc 
ctatgaecgc 
tgtgtgtgct 
aattggagat 



ctccaaccet 
tctaagataa 
atggagaata 
ctecaggttc 
ctggggatga 
Ctcagtaacc 
actgggt tgc 
ttttgtgcag 
tacgcagcag 
tgtctggcta 
acatttcgcc 




ttCaa 



tcaagtttta 
atcaccattc 
tctcacctca 
agttccngtc 
cccatgctca 
aagyttgttg 



ttgtgeataa 
caatatctga 
aaggggattg 
atacagaggt 
ccctctttat 
tcatattaat 
tgtctettge 
ttatagaaga 
tctttgccac 
tgtgtaaccc 
taggctgtta 
tctctttctg 
ctctgacctg 
atgtcttttt 
tta 



acacatctge 
tatectgcta 
t ttcaacaca 
gagtgaattc 
catgtttacc 
cctgctggac 
aggcattggt 
caaagccatc 
tgtggaaaat 
cctacattat 
tgtcattggt 
catgtccaat 



120 
180 
240 
300 
360 
420 
480 
540 
600 




<210> 32 
<211> 1260 
<212> DA 

<213> Homo sapians 



<221> mi»c_f eeture 
<223> Incyta ID No: 7472067CB1 

<400> 32 
atgctggcag 
ctccactttg 
gctctcttgg 
ctccttcaca 
agcctggctg 
aaaagtgttt 
tgcatggcag 
agcgacccag 
atctggactg 
catgaaggtg 
atgtttggta 
tatttctgga 
aaccagatac 
ctcttgtggc 
ccggccccac 
latcctc 



ctgcctttgc 
ccggagggta 
tggctgtctg 
atgcttggaa 
atctctcect 
gggatctagg 
ccaagagcct 
ccaagcaagt 
tggetagcet 
tggaaatgtg 
agctctaccc 
gagcttatga 
gctcaaagca 
tccccgaatg 
eacaaggttt 
tcatttttct 



agactctaac 
cctgccctct 
cctggtgggc 
aggaaagcca 
cctgetgttc 
ctggtttgtc 
gacaatcgtt 
gagtatceac 
gttacccctg 
cctcgtggat 
actcetggca 
ccaatgtaaa 
agtcacagtg 
ggtagcttgg 
catagccctg 
tgtgatgtcg 

tgacaaggtt 
tecctcetct 
gcagttttgg 
ggaacatgaa 



<210> 33 
<211> »4S 
<312> DNA 

< 2 1 3 > Homo sapians 
<220> 

<221> nisc_featur« 

<223> Incyta ID No: 7472072CB1 



tccagcagca 
gattcccagy 
ttcgtgggaa 
tccatgatcc 
tctgcaecta. 
tgcaagtcct 
gtggtggcca 
aactacacca 
ccggaatggt 
gtaccagc tg 
tttggccttc 




atgctgctga 
ctgtgggtat 
tctcaagtct 
gaagagttca 
actgtctcag 
ccatctccag 



tgaatgtgtc 
actggsgaac 
acctgtgtgt 
actccctgat 
tccgagctac 
ctgactggtt 
aagtatgett 
tctggtcagt 
tctttagcac 
tggctgaaga 
cattattttt 
ctaagaetca 
gcattgccat 
ggcate tgaa 
tgatgtttte 
gggaaggctt 
agtctcagga 
aatccccagc 



ctttgctcac 
catcatcccg 
gattgycatc 
tctgaatcte 
ggcgtactcc 

catgtatgea 
gctggtggcc 
catcaggcat 
gtttatgtcg 
tgccagcctt 
aaatcttaga 
catctctgct 
ggctgcaggc 
catctcttca 
gaaaggtgta 
aacaccagct 
atccatacca 



<400> 33 

atgggggatg 

tcaggatcac 



ttcctgctca 
Btggcatcag 
atattcttcc 
gaccgttatg 
tgtctgctga 
tecatcaccc 
gtgccagcce 
acctcagggg 
gtgttgcagg 
tgctcctcgc 
gtgccetgcg 
gtcaccccta 
ttggtcaaag 



tgaatcagtc 
gccagctcct 
ttcttctctt 
gccagctetc 
actttctgcg 
tcaeactgat 
ttgctgtgtg 
tgatgggctc 
tgcattttcc 
taetgaagct 
tgctgatcct 
ctgttctaag 
acatcacggt 
cctaceacag 
cactcaaccc 
tgcttagcag 



ggtggcctca 
cttctccctg 
cttgatcegt 
cctgtttgac 
gggagaaggt 
gggtgtggct 
eeageecctg 
ctcctgggtg 
ctactgtgcc 
cteetgtgea 
aatgctccct 
catgcgctca 
agtggggctc 
tccacagcag 
ccttatctae 



gacttcattc 
gtggctgtca 
gtggactccc 
attggctgtc 
gccacctcct 
gagggcgtcc 
cagtatcctg 
gtaggtgtgc 
tcccgtattg 
gatacctgtg 
ctttccctca 
gaggaggcca 
ttttatggtg 
gataacgtgg 
agtctgagga 



tggtgggcct 
tgtttgtcat 
ggctccatac 
ccatggtcac 
atggaggtgg 
tgttggtcct 
tact tat gag 
tcaacgcctc 
tggatcactt 
cctacgagat 
tcgccaccte 
gacacaaggc 
ccgccgtgtt 
ttteectett 
atccggaggt 
gctga 



60 

120 

180 

240 

300 

360 

420 

480 

54 0 

600 

660 

720 

780 

840 

900 

960 

1020 

10BO 

1140 

1200 

1260 



cttcagtcac 60 
aggccttctg 120 
acccatgtac 1BO 
catccccaag 240 
tgcagctcaa 30O 
catgtcttat 360 
acgccaggta 420 
catccagacc 480 
cttctgtgag 540 
ggcgccgtcc 600 
ctacggccac 660 
tgtcaccacc 720 
catgtacatg 780 
ctatagcctt 840 
gtggatggct 900 
945 



<210> 34 
<211> 765 
<212> DMA 

<213> Homo sapiens 
<220> 

<221> miac_feature 

<223> Incyta ID Ho: 747207 4CB1 

<400> 34 

atggcctccc ggtatgtggc agtgggaatg atcttatcac agaccgtggt gggagtcctg 60 
gggagcttct ctgttcttct ccattatctc tccttttact gcactgggtg caggttaagg 120 
tccacogatt tgattgttaa gcacctgatt gtagccaact tcttagetct ccgctgtaaa 180 
ggagtcceec agacaatggc agcttttggg gttagatatt ttctcaatgc tettgggtgc 240 



gtaaacatca 
aacgaggatt 
gcaatgttgt 
tccatggttt 
ctctccccca 
acctttgtgt 



tctatctcca 
tccaggtgat 
ccaagcatgt 
tctttcccet 
tgggatactg 
tatcattecc 
gcatcctgca 
gagcctcccc 
cttettaeac 



tagagtgggc 



tggcttttct 
gtatgtgact 
ttctggggga 
tgatgtgttg 
caggcacaag 



ccteteetge 



PCTAJS01/03455 

aggggagtgt ccattggcac cacctgcctc 300 
tccaggaaat ccaggtgggc aaaacttaaa 360 
gttctcctgt gctggatcgt gtgcatgttg 420 
ggcaaatgga actacacaaa catcacagtg 460 
ggcaacaaca aaatcgcaca gacactgcgt 540 
tgtctggggc tcatgctctg ggtcagcagc 600 
cagcgggtcc agcacattga taggagcgat 660 
gctacgcaga gcatcctcat cctggtgagc 720 
cttttccaag tttga 76S 



<210> 35 
<211> 10B9 
<212> DMA 

<213> homo sapians 
<220> 

<221> miac^faatura 

<223> Incyta ID No: 7472077CB1 



<400> 35 

Btgtacaagg 

ccatggggca 

ctcttagcat 

cccacccagc 

atcatcgagc 

getetctgtt 

tgtgtctcet 

atcattattg 

atgacaccct 

atggccctca 

cagcatggaa 

atctccatgc 

gtcgtctgca 

gagetetgca 

cccgtcacag 

tggaaggtct 

ggagctgagg 

gatcccacac 

ggagtataa 



actgcatcga 
tcattetgga 
ttctcttcct 
tcctcttcct 
ccaatcaaca 
tctcatgcct 
tctcctggac 
ccactgagta 
gccagctcaa 
cattcttcgt 
ggctcatctt 
tcctgagagg 
ttgctctggt 
ttctctacag 
cctaceaaca 



gtccactgga 
gtccctggcc 
catgcgaaag 
cctgagtgtc 

cttagctcat 
gacaattctg 
tgtgactctc 
tgtggacttt 
ctccaaagcc 
tatcactgtg 
caacccgcag 
caccaacgca 
atcgtgtaga 
cagcttccaa 
ggacttccta 



gaccattttc 
atacttggca 
atecaagact 
ctggggctcr 
gtacgctoct 
gectccaate 
tgcattgcta 
atcatgacea 
gttgrtactcc 
accttctgtg 
ctcttctcca 
ttccagcgac 
tgggttttcc 
caggagtgcc 
gtggr 



ttctctgtga 
tegtggtcac 
gcagccagtg 
tcggactcgc 
ttctctttgg 
tagtgaagct 
ttggttgcag 
gaggtatgat 
tggtctatgt 
gcccgtgtga 
tcatcatctg 
agecccagtg 
tgctgctgta 
ctttacaagg 
aggagctetc 



cgccgagggg 
aattctgcta 
gaatgtcctc 
tttcgccttc 
ggttctcttt 
ggttcggggt 
tetgttgcaa 
gtttgtgaat 
cctcttcctg 
gaactggaag 
ggtggtgtgg 



120 

1B0 

240 

300 

360 

420 

480 

540 

600 

660 

730 

780 

840 

900 

960 

1020 

1080 

1069 



<210> 36 

<211> 1334 

«212> DNA. 

<213> Hono aapiana 

<220> 

<221> misc.Caatura 
<223> Incyta ID Not 

<40O> 36 
atgtgtaaat 
gcatggcagc 
aattccaaga 
gtggaccctg 
ccagagctcc 
ggcaacttgg 
tatttcctca 
atgc tggtga 
ctctatttat 
gactgttatg 
tgcttctggc 
agtcttatgt 
cttttccctc 
gtcttgagtg 
atcattgcca 
tgcagctccc 
cagccatcat 



7472082CB1 



gcttcagaag tggcaatagc 
ccaggaaggc cccaagaaca 
" gtacattaag 
ctcagtgact 
cttcctcctg 
actgattggg 
cttcattgac 
agagaagaae 
tgctattgca 
cagccccttg 
agtttacatt 
tttgtgcaag 
ctcctgctce 
cctgatgcct 
cattcactcc 
tgttgetgtt 



gaaaccattc 
agctgcgcct 
geatgatcac 
gcagtctgtc 
actttgcgac 
tcagtatttt 
ttgccatctg 
tcacagtggg 
tgagactctt 
tcttggggct 
catttaacat 
gcatcctccg 
acatcttggc 
ctgtcagctc 



actccagtcc 
cagcaaactg 
gaaaatttca 
gagtccattc 
ttcttaggaa 
ctcagttctc 
ttctgccatt 
atcatctcct 
gagtgtcaca 
ctgtacaatg 
ttaggcatcc 
actaatgtga 
agcacctaca 
gccttaacca 



tgtgteaecg 
acatgggtta 
aaaagactgt 
tggctgggct 
tctgtgtggt 
acctgcacac 
ccoctgtcat 
accctgaatg 
tgttggctgc 
tcatcatgte 
ttggatctac 
ttaaccatta 
tcaatgaatt 
tccttgcttc 
ggtccaaagc 
ctgcagcatt 
cgtctgtgtt 



tgacaagatc 
ctcagaacag 
cacagtggtg 
acctatgtae 
tacccctaag 
catggctcag 
aatggcgtat 
ctateaeeac 
aattcatacc 
tttttgtgat 
actggttctg 
ttacatcttt 
cttcagcact 
catgtacctg 
ttatactact 



120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 



WO 01/5708$ 



PCT/BS0MJ3455 



attgcgccca tgctgaaccc cctgatctac agcctgagga acaaagatgt caaacctgcc 1080 
gtgaagaaaa ttctgcatca gacagcatgt taatgaatag aatcaatgtt atgttgttac 1140 
atcaagatag gtctttggtt tgattagata tctaacttat tggatttatt gttgagattt 1200 
atgaaaattt agtgatgctc ttttatgtaa cacctctcca aaatattcct ccggtctgct 1260 
tccatcgaac ttatattcca atgagcatat gtaaagaaat acaaagaata aaatcaaaag 1320 
acttttgagg ttta 1334 

<210> 37 

<211> 960 

<212> DNA 

<213> Homo sapians 

<220> 

<221> mlsc.faatura 
<223> Incyta ID No: 

<400> 37 
atgacacctg 
ttgcagggat 
atctatgcca 
catctccaaa 
tcttctgtgg 
tttgagggct 
ctgctggcct 
tctatcatga 
gccaactccg 
ctaacacact 
acagcaagag 
attctcacct 



747212BCB1 



gagaactagc 
tctccaatta 
taacagtggt 
gcccaatgta 



ccttgccagt 
tccagacctc 
gggcaacttg 
tttcttcctc 
gctcttggtc 
gctcgccttc 



ctatccagac 
aetactgtga 
tggtcctcta 
ectactgctt 
acctctccac 
tcacctatgt 



ctgtctccag 
tggaaatgtc 
eataccaccc 
tgtcttttct 
ggtcttggtg 

CtgtgCCtCC 

tcagccccat 
cataattccc 
tctgcagagg 



ggcaaccaca 
caggagcttc 
ggaatgatgg 
aatgtcctct 
aacttcctgg 
tttgtagtgc 
ctagccatct 
ctggtagctg 
tttgccctgc 
cttctccacc 
gctctggtca 
gccattggga 
cactttctgg 
ggatctacta 
atgctcaatc 



ccccagtcac 
tcttcggagc 
cactcatctc 
cgtttcttga 
tctctgacaa 




<210> 38 

<211> 939 

<212> DNA 

<213> Hono sapians 

<220> 

<221> mlsc_faatura 
<223> Incyta ID No: 



<400> 38 

atggacacag 

cagggtgtcc 

ggaaacctgc 

cactttgtca 

atgc tggeaa 

atctatttct 

gataggtatt 

tgtgcagaga 

tccttgattt 

ttccctcctg 

gttataaatt 

atcatctgca 

tgtgcctccc 

cage tg nag a 

ctcacaccct 

gtgaggaggc 



<210> 39 
<211> 96B 
<212> DNA 
<213> Hono sapiai 



agatttatct 
tgatattcct 
gcat tctctc 
acttgetcag 
ttcactccct 
tagecatctg 
ttgccattgg 
cacgcctcec 
tgctgagttt 
ectgeaagat 
cagtgctcag 
acttcactgt 
agagctactc 
tcctcaaccc 



7472134CS1 



ccaggtagca 
cttcctcttg 
ggtggtctgc 
cttctcagag 
tgagaaaaag 
tggagegact 
ccggcccctc 
ctgttggttg 
attctgtggc 
ggcttgeact 
cetagccacc 
aattccctea 
ggttctcatc 
actggactat 
cttcatctac 
aattgggata 



gaat'tcatca 
ttgettctca 
ctggactccc 
cttggctata 
accatttcat 
gagtgetate 



ggaggcttgg 
cccaatcgca 
gataegtcta 
ttcctgctga 
gctgccggca 
ttctatggga 
gaccaggccc 
agettgegea 
ttggcatga 



tettgggctt 
tttacctcat 
ggcttcacac 
cagctgccac 
tctctgggtg 
tcctgacagc 
ccctcatgac 



ttcagcaegt 
taaatgtcct 
tcctctgctc 



caagttcatc 60 
catcctgctc 120 
cacagactce 180 
tatttgttac 240 
gtccatctct 300 
tgagagctte 360 
ccattatggt 420 
atttggtgga 480 
gcccaaccag 54 0 
caacacagcc 600 
tgctgcagtc 660 
agtagcaggg 720 
tttctatggc 780 
tggccaagta 840 
tagcctecgc 900 
tcccggctga 960 



cccccatctc 60 
gactgtgttg 120 
acccatgtac 180 
catccctaag 240 
tctcctgeag 300 
tatggectae 360 
cccaacactt 420 
sgttgaaatt 480 
cttttgtgac 540 
agtagatttt 600 
ctatgtgcag 660 
catctccacg 720 
catgtatgtg 780 
ctactcagtg 840 
900 
939 



28731 



29/31 



WO 01/570*5 



<221> nitc_fMtur« 
<223> Incyta ID No: 



• 



PC17US01rt>3455 



PCTAJS01A134S5 



<400> 39 

ggatgaacca 

ggagcacagt 

gcgggatggc 

ecttctgcat 

cttccacgct 

tgntgaagag 

gcacccagcg 

aectgtcage 

cctcttcctt 

tggtceaggc 

ccctctttgt 

tcgtggtggt 

actggtttgt 

tgtcaegcct 



gactttgaat 
gcacacggcc 
aggcaacagc 
ctatatcctc 
cagcctggaa 
actgatgtae 
ctgtctctct 
ctgggtgtgt 
ctgcagcaag 
cgccctcatc 
ctgggtgegg 
cctggcctct 
gctctactgg 
ctcctcgtcc 



agcagtggga 
toectggtgc 
atggtgatct 
aacctggcgg 
acccagcccc 
tttgcctaca 
gtcctcttcc 
ggcctgctgt 
tcettgaaat 
otgggggtct 
aggagctccc 
gccctggtgt 
ttgagcetgc 



ccg tggagtc 
tgagctccct 
ggctgctggg 
cagccgacct 
tggtcaatae 
cagtgggcct 
ctatctggtc 
ggaeactccg 
teaacgaaga 
taaccceagt 
agcagtggcg 
tcctcatctg 
cgcccgagat 
gcgccaaccc 
ggtccctggg 



agccctaaac 
ggccatgttc 
ctttcgaatg 
cctcttcctc 
cactgacaag 
gagcctgcCg 
caagtgtcac 
tetcctgatg 
tcggtgcttc 
gatgactctg 
gcggcagccc 
ttecctgcct 
gcaggtcctg 
cgtcatctae 
gactgtgctc 



atttggagca 
ttcgtettca 
otcactatcc 
ttcatggata 
tcccagaaaa 
ggctccacag 
aacccgctga 
tcccgggtga 
ctctgtggga 
tgcacaagtt 
attccaatgc 
acctcagcag 
attttgtatt 
aaaatagaca 
atttacagtt 
ittgcatc 



tttttttctc 
gccttgtaat 
tagattcacg 
tttgttaeae 
ccattntctt 
agtgtgtgct 
gatactccat 
cgggttgcct 
atctcatcga 
cactgctcat 
tcttagtttg 
agggaagaaa 
atggggctgc 
aaatcatctc 



ttcgcgagga gcccgagctg gaaggtgggg agacgcccac cgtgggcacc 
gggcttga 

<210> 40 
<211> 1000 
<312> DNA 
<213> Homo aapiana 

<220> 

<221> mi»c_f oatur* 
<223> Incyta ZD Ho: 7473142CB1 

<400> 40 
attaaatgaa 
aaacttcacc 
agtggttctc 
tcttattttg 
aaatctctct 
cttgctgtca 
ccttgccatg 
ggccatttgt 
ggccacggte 
gcagataccc 
aaagttagct 
eetcttgcca 
tctgagaatc 
gactgtggtg 
aaatgcacaa 
gaaccccata 



tattccagag 60 
acctgcctgt 120 
cacaggaaee lBD 
ttcagcatgg 240 
gtccacgagc 300 
acggccatca 360 . 
cggcccagge 420 
aacgggttga 480 
agggtggaca S40 
tccagcctga 600 
acaeggctgt 660 
ctgagcatct 720 
tgcttcagct 780 
ttcctggtgg 840 
caacaggcge 900 
aatgagatgg 960 
968 



tatagatgaa 
ggagggattt 
gtatctgaca 
ccttaaaacc 
atctgcctct 
ttetgggtgt 
cctggccgtg 
catcatgaac 
gaccgctctg 
tcacttcacg 
gaacaccatc 
cacctcttac 
caaggctttt 
cctctctatg 
gttgctttac 
ggaagtcaaa 
acatctctga 



tcccagtacc 
aegctcttgg 
eceatgtact 
gttcctaett 
gctgtacaga 
atggcatatg 
aggtgcgtct 
ctggaancca 
tgtgaaattc 
atgctggtgg 
atcttcatcc 
tctaec tgtg 
tacctaaagc 
ggagtgctta 
gatgctatga 



tgcaaggaga 
cagggttaga 
gcaacagcac 
tattccttgg 
tgctggtgaa 
tgtatctgtc 
accgttatgt 
gtgcacggat 
gttttgccct 
t ggcgg t gc t 
tcagcattct 
tttccactat 
gtgcccattt 
cttcttcatc 
cccctatgtt 
agaaattget 



120 
180 
240 
300 
360 
420 
480 
540 
600 
6E0 
720 
780 
640 
9 DO 
960 
1000 



7472171CB1 



<210> 41 
<211> 1008 
<212> DNA 
<213> Horn© 

<220> 

<221> misc_fnatur« 
<223> Incyta ID No: 

<400> 41 

ctgtggacca cctcttcaga actctgcagc atggagccgc tcaacagaac agaggtgtcc 60 

gagttctttc tgaaaggatt ttctggetac ccagccctgg agcatctgct cttccctctg 120 

tgctcagcca tgtacctggt gaccctcctg gggaacacag ceatcatggc ggtgagcgtg 180 

ctagatatcc acctgcacac gcccgcgcac tccttcctgg gcaacctctc taccctggac 240 

atctgctaca cgcccaeett tgtgcctctg ntgctggtcc acctcctgtc atcccggaag 300 

accotctcct ttgctgtctg tgccatecag atgtgtctga gcctgtccac gggctccacg 3 60 

gagtgcetgc tactggccat cacggcctat gaccgctacc tggccatctg ccagccactc 420 



aggtaccacg tgctcatgag WnTaWygctc tgcgtgctgc tgatgggagc tgcctgggtc 480 
ctctgcctcc tcaagtcggt gactgagatg gtcatctcca tgaggctgcc ettctgtggc 540 
caccacgtgg tcagtcactt cacctgcaag atcctggcag tgctgaagct ggcatgcggc 600 
aacacgtcgg tcagegaaga cttcctgctg gcgggctcea tcctgctgct gectgtaccc 660 
ctggcattca tctgcctgtc ctacttgctc atcctggcca ccatcctgag ggtgccctcg 720 
gccgccaggt gctgcaaagc cttctccacc tgcttggcac acctggctgt agtgctgctt 780 
ttctacggca ccatcatctt caCgtacttg aagcccaaga gtaaggaagc ccacatctct 840 
gatgaggtct tcacagtcct ctatgccatg gtcacgacca tgctgaaccc caccatctac 900 
agcctgagga acaaggaggt gaaggaggcc gccaggaagy tgtggggcag gagtcgggcc 960 
tccagtgagg gagggcgggg ctctgtacag acgeaggtet caggttag 1008 

<210> 42 
<211> 972 
<212> DMA 

<213> Hotno sapitma 
<220> 

<221> mis c_r eat urn 

<223> Incyta ID Not 7472172CB1 

<400> 42 
ggaaaccetg 
accctaatcg 
gtaacttaca 
catcgcctgc 
tacaecactg 
tcttttgctg 
atcctcttgg 
a egg tea tea 
ttcctgtctg 
attaatcatt 
c tea t tgaga 
actaetgggt 
cgtcagaaag 
agcaacatct 
gccgcCgtce 



cccaccatat 
ccttcccagc 
cattaacagc 
aaactecaat 
tcattacccc 
gttgeatgat 
cggtgatgtc 
tgaacagcag 
tgttgtttcc 
tcttctgtga 
agataaactt 
cctaegtgta 
ctttttctoc 
ttgtgtatgt 
tcatcacagt 



agtagttgtc 
tctcctggag 
aacaggaaac 
gtacttcttc 
aaagttgttg 
ccaaacatat 
ctttgaccgc 
ggectgeett 
aaccattgCa 
cattgcccct 
tctcctctct 
cataatttct 
ctgtgcttct 



atgggaaact 
attcgaatat 
atcaecatca 
ctcagtaatt 
gcctgcctcc 
ttctacttct 
tacaeggcta 
ctgctggttc 
gtgacaaggc 
cttcttcagg 
gcccttgtca 
accatcctgc 
cacatcactg 



ggtgocccct ctcctgaacc 



aatgagaagg tacaggaagt gttgagagag acagtgaaca 
aggaaaactt ga 



ggagcactgt 
ctctcttcgt 
tctccctgat 
tgtcctttet 
taggagaaga 
ttctggggac 
tctgcgaccc 
tgggatgctg 
taccttactg 
tggectgtat 
tcctgagetc 
gtatcccctc 
ttgtctccat 
cactggatta 
cttttatcta 
gaatcatgac 



gactgaaatc 60 
ggttcttgtg 120 
atggattgat 180 
ggatatctta 240 
gaaaaccata 300 
ggtggagttt 360 
actgcactac 420 
ggtgggagce 460 
taggnnngan S40 
aaatactcac 600 
cctggeatte 660 
cacccagggc 720 
tgcccacggg 760 
tgacaaggtg 840 
cagcttgagg 900 
cttgatacaa 960 
972 
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SEQ ID NO: 11; SEQ ID NO: 12 

page 14, line 1 - line 14; figures 5,6 

page 15, line 15 -page 16, line 3 

DATABASE EHSL (Online] 

Accession no. AC010111, 

14 September 1999 (1999-09-14) 

HUHZV D.M. H AL.: "Drosophila 

melanogaster clone RPC198-9B18, *** 

SEQUEttCIKG IN PROGRESS ***,' 

XP802175188 

-/" 



eg • 



1-19. 
22-28 





17 August 2001 



1 6 It. 01 



Schdnxasser, 0 



! 1 of 2 



INTERNATIONAL SEARCH REPORT 



DATABASE SHALL [Online] 

Accession no. Q9VNM1, 

1 Hay 2088 (2088-85-81) 

ADAMS H.D. ET At.: 'The genome sequence of 

Drosophile oelanogaster; CG1147 PROTEIN • 

XP882175189 

WO 01 31005 A (PHARMACIA & UPJOHN CO) 
3 Hay 2081 (2001-05-03) 



SEQ ID NO: 3; SEQ ID NO: 6 

DATABASE SMALL [Online] 

Accession no. 097505, 

1 Hay 1999 (1999-05-81) 

I TO Y. ET AL.: •Sus scrofe NPY Y4 gene for 

neuropeptide Y receptor type 4, complete 

cos; NEUROPEPTIDE Y RECEPTOR TYPE 4 

(NEUROPEPTIDE Y- FAMILY RECEPTOR Y4)" 

XP882175198 



1.3. 
6-18. 
12-16. 
18.19. 

22-27 



1-19, 
22-28 



page 2 of 2 



INTERNATIONAL S 



PCT/US 01/83455 



»Artktet7<2Xa)lor»*lo 



. fx] CbrirasNuj 



Although claims 18 and 24 are directed to a method of treatment of the 
human/animal body, the search has been carried out and based on the alleged 
effects of the compound/composition. 

m ctafcmi*^ 28.21 not searched;23.24 searched incompletely 

see FURTHER INFORMATION sheet PCT/ISA/218 
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. f7] Mo r»K»-«: Kfc!i^r-J Man* Wl timaiy B» apfflcac*. Con.wr.wOy. this k- 

1-19.22-28 (all partially) 
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Form PCT/IS/W1 0 (cor*™*** et M ihnt (1)) (July »998) 



PCT/US 81/83455 



No. PCT/US 01 /03455 



FURTHER INFORMATION CONTINUED FROM PCT/TSA/ 210 



This International Searching Authority found multiple (groups of) 
Inventions in this international application, as follows: 

1. Claims: 1-19,22-28 (all partially) 

Invention 1: 

An Isolated polypeptide comprising the amino acid sequence 
of SEQ ID N0:1 or honologues or fragments thereof; an 
isolated polynucleotide encoding said polypeptide; a 
recombinant polynucleotide comprising a promoter sequence 
operably linked to said polynucleotide; a cell transformed 
with said recombinant polynucleotide; a transgenic organism 
comprising said recombinant polynucleotide; a method for 
producing said polypeptide; an isolated antibody which 
specifically binds to said polypeptide; an isolated 
polynucleotide comprising the polynucleotide sequence of SEQ 
ID NO: 22 or honologues or complements thereof; methods for 
detecting a target polynucleotide in a sample, wherein said 
target polynucleotide has a sequence of above 
polynucleotide; a pharmaceutical composition comprising an 
effective amount of above polypeptide; a method for treating 
a disease or condition comprising administering said 
composition; methods for screening a conpound for 
effectiveness as an agonist or antagonist of above 
polypeptide; a composition comprising an antagonist conpound 
identified by said screening men t hod, a method of treating a 
disease or condition comprising administering to a patient 
said composition; a method for screening for a compound that 



specifically binds to above polypeptide; a method of 
screening for a conpound that modulates the activity of 
above polypeptide; a method for screening a conpound for 



effectiveness in altering expression of a target 
polynucleotide, wherein the target polynucleotide comprises 
above Isolated polynucleotide; a method for assessing 
toxicity of a test compound involving at least a fragment of 
above polynucleotide. 

2. Claims: 1-19.22-28 (all partially) 
Inventions 2^21: 

Inventions no. 2-21 relate to subject-matter as defined 
above for »1nvent1on no. l", with the exception, that each 
of the inventions refers to one of the polypeptide sequence 
of SEQ ID N0:2-SEQ ID N0:21 as mentioned in claim 1 (and to 
one of the corresponding nucleotide sequences SEQ ID NO: 23- 
SEQ ID N0:42), but not to the polypeptide sequences SEQ ID 
N0:1 (or to the corresponding polynucleotide sequence SEQ ID 
NO: 22) . 



FURTHER INFORMATION CONTINUED FROM PCT/ISA/ 210 



^Continuation of Box 1.2 
Claims Nos.: 20,21 not searched; 23. 24 searched incompletely 



ig a irue leoimcai cnsracienianon wt saig 
> such compounds are defined in the application. In 
t of said claims is ambiguous and vague, and their 
sufficiently disclosed and supported {Art. 5 and 6 



Claims 20 and 21 refer to agonists identified by a method described in 
claim 19 without giving a true technical characterization of said 
agonists. Moreover, ^ - 

consequence, the sc< . 
subject-matter is not sufficiently c... 

PCT). No search can be carried out for such purely speculative claims 
whose wording is, in fact, a mere recitation of the results to be 
achieved. 

Similarly, present claims 23 and 24 relate to antagonists identified by a 
method described in claim 19 by reference to a desirable characteristic 
or property, namely the property of being identifyable by a method as 
described in claim 22. 

The claims cover all antagonists having this characteristic or property, 
whereas the application provides support within the meaning of Article 6 
PCT and/or disclosure within the meaning of Article 5 PCT for only a very 
limited number of such antagonists. In the present case, the claims so 
lack support, and the application so lacks disclosure, that a meaningful 
search over the whole of the claimed scope is impossible. Independent of 
the above reasoning, the claims also lack clarity (Article 6 PCT). An 
attempt Is made to define the product by reference to a result to be 
achieved. Again, this lack of clarity 1n the present case is such as to 
render a meaningful search over the whole of the claimed scope 
impossible. Consequently, the search has been carried out for those parts 
of the claims which appear to be clear, supported and disclosed, namely 
those parts relating to antibodies against the polypeptide of claim 1 as 
antagonists for said polypeptides (please see page 40, line 32 - page 41. 
line 4). 

The applicant's attention is drawn to the fact that claims, or parts of 
claims, relating to inventions in respect of which no international 
search report has been established need not be the subject of an 
International preliminary examination (Rule 86.1(e) Pa). The applicant 
is advised that the EP0 policy when acting as an International 
Preliminary Examining Authority is normally not to carry out a 
preliminary examination on matter which has not been searched. This is 
the case irrespective of whether or not the claims are amended following 
receipt of the search report or during any Chapter II procedure. 



INTERNATIONAL SEARCH REPORT 




WO 9623889 


A 


08-88-1996 


US 


5621079 A 


15-84-1997 








CA 


2212225 Al 


88-88-1996 








EP 


0889648 Al 


83-12-1997 








JP 


11500610 T 


19-61-1999 








HO 


9623809 Al 


88-68-1996 








US 


5939263 A 


17-88-1999 


WO 9131005 


A 


03-05-2001 


AU 


1218601 A 


88-85-2001 






WO 


0131885 A2 


03-05-2001 



This Page is Inserted by IFW Indexing and Scanning 
Operations and is not part of the Official Record 

BEST AVAILABLE IMAGES 

Defective images within this document are accurate representations of the original 
documents submitted by the applicant. 

Defects in the images include but are not limited to the items checked: 

□ BLACK BORDERS 

□ IMAGE CUT OFF AT TOP, BOTTOM OR SIDES 
-□"FADED TEXT OR DRAWING 

□ BLURRED OR ILLEGIBLE TEXT OR DRAWING 

□ SKEWED/SLANTED IMAGES 

□ COLOR OR BLACK AND WHITE PHOTOGRAPHS 

□ GRAY SCALE DOCUMENTS 

□ LINES OR MARKS ON ORIGINAL DOCUMENT 

□ REFERENCE(S) OR EXHIBIT(S) SUBMITTED ARE POOR QUALITY 

□ OTHER: 

IMAGES ARE BEST AVAILABLE COPY. 
As rescanning these documents will not correct the image 
problems checked, please do not report these problems to 
the IFW Image Problem Mailbox. 



